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(57 ) Abstract: The present invention provides nucleic acid molecules, DNA constructs, plasmids, and methods for post-transcrip- 
tional regulation of gene expression using RNA molecules to both repress and activate translation of an open reading frame. Repres- 
sion of gene expression is achieved through the presence of a regulatory nucleic acid element (the cis-repressive RNA or crRNA) 
within the 5' untranslated region (5' UTR) of an mRNA molecule. The nucleic acid element forms a hairpin (stem/loop) structure 
through complementary base pairing. The hairpin blocks access to the mRNA transcript by the ribosome, thereby preventing trans- 
lation. In particular, in embodiments of the invention designed to operate in prokaryotic cells, the stem of the hairpin secondary 
structure sequesters the ribosome binding site (RBS). In embodiments of the invention designed to operate in eukaryotic cells, the 
stem of the hairpin is positioned upstream of the start codon, anywhere within the 5* UTR of an mRNA. A small RNA (trans-ac- 
tivating RNA, or taRNA), expressed in trans, interacts with the crRNA and alters the hairpin structure. This alteration allows the 
ribosome to gain access to the region of the transcript upstream of the start codon, thereby activating transcription from its previously 
repressed state. 



WO 2004/046321 A2 1111011111111011111111110110111 



For two -teller codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



)W*> JC13Rec'dPCT/PT0 1 6 MAY 2003 

WO 2004/046321 PCT/US2003/036506 



CIS/TRANS RIBOREGULATORS 

Related Application Information 

[0001] This application claims the benefit of U.S. Provisional Application Ser. No. 
60/426,891, filed November 15, 2002, which is hereby incorporated by reference. 

Government Support 

[0002] This invention was made with Government Support under Grant Number 
F30602-01 -2-0579 awarded by the Air Force Research Laboratory and Grant Number EIA- 
0130331 awarded by the National Science Foundation. The Government has certain rights 
in the invention. 

Background of the Invention 

[0003] Virtually all forms of life exhibit the ability to control gene expression, e.g., in 
response to environmental conditions or as part of the developmental process, and a myriad 
of different mechanisms for controlling gene expression exist in nature. These mechanisms 
permit cells to express particular subsets of genes and allow them to adjust the level of 
particular gene products as required. For example, bacteria and eukaryotic cells are often 
able to adjust the expression of enzymes in synthetic or metabolic pathways depending on 
the availability of substrates or end products. Similarly, many cells are able to induce 
synthesis of protective molecules such as heat shock proteins in response to environmental 
stress. Inherited or acquired defects in mechanisms for control of gene expression are 
believed to play a signficant role in human diseases (e.g., cancer), and targeted disruption of 
important regulatory molecules in mice frequently results in severe phenotypic defects. 
[0004] A number of approaches have been developed in order to artificially control 
levels of gene expression, many of which are modeled on naturally occurring regulatory 
systems. In general, gene expression can be controlled at the level of RNA transcription or 
post-transcriptionally, e.g., by controlling the processing or degradation of mRNA 
molecules, or by controlling their translation. For example, modulating the activity of 
transcription factors (e.g., by adminstration of small molecule activators or inhibitors) is 
being pursued as a method of controlling mRNA levels (see, e.g., Nyanguile O, Uesugi M, 
AustinDJ, Verdine GL. Proc Natl Acad Sci USA. 1997, 94(25): 13402-6. A nonnatural 
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transcriptional coactivator.). Antisense strategies for gene silencing, in which an antisense 
RNA or DNA binds to a target RNA and results in inactivation, are also being actively 
pursued for applications ranging from functional genomics to therapeutics (Giles RV, 
"Antisense oligonucleotide technology: from EST to therapeutics" Curr Opin Mol TJier. 
2000, 2(3):238-52). Nucleic acid enzymes such as ribozymes, i.e., RNA molecules that 
exhibit the ability to cleave other RNA molecules in a sequence-specific manner, offer 
another method for regulating gene expression (Sioud M., "Nucleic acid enzymes as a novel 
generation of anti-gene agents", Curr Mol Med. 2001, l(5):575-88). More recently, the 
discovery of RNA interference (RNAi), in which the presence of double-stranded RNA 
leads to degradation of a target RNA transcript, has provided another approach to the control 
of gene expression (Hutvagner, G. and Zamore, PD., "RNAi: nature abhors a double- 
strand", Curr. Op. Genet Dev., 12:225-232, 2002). 

[0005] Although the approaches described above have proven extremely valuable, they 
have a variety of features that limit their usefulness. For example, methods that involve 
alterations in RNA transcription may have slower response times than methods that are 
based on post-transcriptional regulation. Techniques involving modulation of transcription 
factors are generally limited to well-characterized transcription factors. Antisense, 
ribozyme, and RNAi-based approaches typically require sequence-specific design. It is 
evident that a need exists in the art for additional systems and methods for the control of 
gene expression. In particular, there exists a need for modular systems that function with a 
wide variety of genes and that can be integrated into biological networks. Furthermore, 
there exists a need in the art for systems that would afford the ability to artificially control 
gene expression within cells in response to external stimuli. 

Summary of the Invention 

[0006] The present invention addresses these needs, among others, by providing systems 
and methods for the post-transcriptional control of gene expression in prokaryotic or 
eukaryotic cells. The invention provides an artificial RNA-based system that enables 
precise control through highly specific RNA-RNA interactions. According to the invention 
effective repression is achieved by engineering an RNA molecule (or template for the RNA 
molecule), so that the engineered RNA forms a secondary structure that prevents the 
ribosome from gaining access to the RNA at an appropriate location to begin translation. 
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Repression of gene expression is achieved through the presence of a regulatory nucleic acid 
element (the ciy-repressive RNA or crRNA) within the 5 9 untranslated region (5' UTR) of 
an mRNA molecule. The nucleic acid element forms a hairpin (stem/loop) structure through 
complementary base pairing. The hairpin blocks access to the mRNA transcript by the 
ribosome, thereby preventing translation. A small RNA (fra/w-activating RNA, or taRNA), 
expressed in trans, interacts with the crRNA and alters the hairpin structure. This alteration 
allows the ribosome to gain access to the region of the transcript upstream of the start codon, 
thereby activating transcription from its previously repressed state. 
[0007] In one aspect, the invention provides an engineered nucleic acid molecule 
comprising: (i) a first stem-forming portion; (ii) a second stem-forming portion, wherein the 
two stem-forming portions are complementary or substantially complementary, and (iii) a 
non-stem-forming portion that forms a loop connecting the 3' end of the first stem-forming 
portion and the 5' end of the second stem-forming portion, wherein the engineered nucleic 
acid molecule forms a stem-loop structure that represses translation when positioned 
upstream of an open reading frame (ORF). When present as RNA, the nucleic acid 
molecule is referred to as a ds-repressive RNA (crRNA). The invention further provides 
DNA contracts and plasmids that comprise templates for transcription of a crRNA as well as 
cells comprising crRNA elements, DNA constructs, and plasmids. 
[0008] In another aspect, the invention provides an engineered nucleic acid molecule 
comprising: (i) a first stem-forming portion; (ii) a second stem-forming portion; and (iii) a 
non-stem-forming portion, wherein the non-stem-forming portion connects the 3' end of the 
first stem-forming portion and the 5' end of the second stem-forming portion to form a loop, 
and wherein a portion of the nucleic acid molecule is complementary or substantially 
complementary, to a portion of a cognate cw-repressive nucleic acid molecule. When 
present as RNA, the nucleic acid molecule is referred to as a ta2/w-activating RNA (taRNA). 
The taRNA interacts with a cognate crRNA to derepress transation that is repressed by the 
crRNA. The invention further provides DNA contracts and plasmids that comprise 
templates for transcription of a taRNA as well as cells comprising taRNA elements, DNA 
constructs, and plasmids. 

[0009] In addition, the invention provides a system for control of gene expression 
comprising: (i) a first nucleic acid molecule comprising a czs-repressive sequence element 
upstream of an open reading frame (ORF), wherein the first nucleic acid molecule forms a 
stem-loop structure that represses translation of the ORF; and (ii) a second nucleic acid 
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molecule comprising first and second stem-forming portions and a non-stem-forming 
portion, wherein the non-stem-forming portion connects the 3' end of the first stem-forming 
portion and the 5' end of the second stem-forming portion to form a loop, and wherein a 
portion of the second nucleic acid molecule is complementary or substantially 
complementary to a portion of the first nucleic acid molecule and interacts with the first 
nucleic acid molecule to derepress translation of the ORF. 

[0010] In another aspect, the invention provides a method of regulating translation of an 
open reading frame comprising: (i) introducing an engineered template for transcription of 
an mRNA into a cell and allowing mRNA trancription to occur resulting in a transcribed 
mRNA, wherein the template is engineered so that the transcribed mRNA comprises first 
and second nucleic acid elements that form a stem-loop structure that represses translation of 
the mRNA; and (ii) providing an engineered nucleic acid molecule that interacts with the 
mRNA so as to derepress translation of the mRNA to the cell. 

[0011] In certain embodiments of the invention the engineered template comprises: (i) a 
first stem-forming portion; (ii) a second stem- forming portion, wherein the two stem- 
forming portions are complementary or substantially complementary, and (iii) a non-stem- 
forming portion that forms a loop connecting the 3' end of the first stem-forming portion 
and the 5' end of the second stem-forming portion, wherein the engineered nucleic acid 
molecule forms a stem-loop structure that represses translation when positioned upstream of 
an open reading frame (ORF). In certain embodiments of the invention the engineered 
nucleic acid molecule comprises: (i) a first stem-forming portion; (ii) a second stem-forming 
portion; and (iii) a non-stem-forming portion, wherein the non-stem-forming portion 
connects the 3' end of the first stem-forming portion and the 5' end of the second stem- 
forming portion to form a loop, and wherein a portion of the nucleic acid molecule is 
complementary or substantially complementary, to a portion of the transcribed mRNA. 
[0012] In another aspect, the invention provides a method of selecting a cognate pair of 
nucleic acid molecules for regulating translation comprising steps of: (i) providing one or 
more starting nucleic acid sequences; (ii) randomizing the sequence or sequences to generate 
one or more pools of randomized nucleic acid sequences; and (iii) employing in vitro 
selection to identify a candidate cognate nucleic acid pair comprising a repressive element 
that represses translation when positioned upstream of an ORF and an activating element 
that derepresses translation that is repressed by the candidate repressive element. 
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[0013] This application refers to various patents and publications. The contents of all of 
these are incorporated by reference. In addition, the following publications are incorporated 
herein by reference: Current Protocols in Molecular Biology, Current Protocols in 
Immunology, Current Protocols in Protein Science, and Current Protocols in Cell Biology, 
all John Wiley & Sons, N.Y., edition as of July 2002; Sambrook, Russell, and Sambrook, 
Molecular Cloning: A Laboratory Manual, 3 rd ed. Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, 2001. 

Brief Description of the Drawing 

[0014] Figure 1A illustrates the artificial riboregulator system used to control post- 
transcriptional regulation. Basic steps of native prokaryotic gene expression are illustrated 
in the box. A promoter, P, drives the expression of a gene (GFP). After transcription, 
mRNA is present with a ribosome binding site (RBS) available for docking by a ribosome. 
After ribosome binding, translation of a functional protein occurs. 
[0015] Figure IB schematically illustrates the functioning of the m-repressive and 
fra/is-activating riboregulators in a prokaryotic system. 

[001 6] Figure 2 A illustrates M-fold (3 5) predicted secondary structures of four crRNA 
variants (in ascending gray scale) and of a control RNA with an arbitrary sequence in place 
of the ay-repressive sequence. crRL (lightest), crR7, crRlO, crRB , and control (darkest). 
The ribosome binding sites (RBS) are boxed; YUNR recognition motif of loops in light 
grey; cis-repressive (cr) sequences in italics; start codons (AUG) in bold. 
[0017] Figures 2B and 2C show czs-repression results of crRNA variants (in ascending 
gray scale): crRL (lightest), crR7, crRlO, crRB, and control (darkest, labeled +C). Flow- 
cytometric results of crRNA variants driving the expression oigfpmut3b at intermediate 
(Figure 2B) and high (Figure 2Q transcription rates are shown. Histograms represent GFP 
expression of cultures containing each construct in Figure 2 A. crRL (lightest), crR7, crRlO, 
crRB , and control (darkest, labeled +C). Negative control curve (-C) corresponds to 
fluorescence measurement cells containing plasmids that lack GFP (autofluorescence 
measurement). 

[0018] Figures 3A and 3B show M-fold predicted (35) structures of taR12 (Figure 3A) 
and crR12 (Figure 3B) variants using the same scheme as Figure 2. As in Figure 2A, the 
ribosome binding site (RBS) is boxed; YUNR recognition motif of loop in light grey; cis- 
repressive (cr) sequence in italics; start codon (AUG) in bold for crRNA. 
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[0019] Figure 3C shows a schematic representation of the proposed mechanism for 
artificial ribos witch. The YUNR motif (medium grey) of taRNA targets its complementary 
sequence (medium grey) on crRNA. The linear-loop complex formed by taRNA-crRNA 
interaction, destabilizes the hairpin stem-loop which obstructs ribosomal recognition of 
RBS (light grey). The resulting RNA duplex exposes the RBS and allows translation to 
occur, (cis-repressive sequence in dark grey; start codon (AUG) noted). 
[0020] Figure 3D (trans-activation results) shows flow-cytometric results for taRlO- 
crRlO riboregulator system. Autofluorescence measurement (- C) (cells lacking GFP) in 
black and GFP expression of control (+ C) (cells without cis-sequence) in light grey. 
Intermediate grey curves depict cis-repressed cultures (labeled crRlO, no arabinose) and 
cells containing high levels of taRNA (labeled taRlO, 0.25% arabinose). 
[0021] Figure 3E (trans-activation results) shows flow-cytometric results for taR12- 
crR12 riboregulator system. Autofluorescence measurement (- C) (cells lacking GFP) in 
black and GFP expression of control (+ C) (cells without cis-sequence) in light grey. 
Intermeidate grey curves depict cis-repressed cultures (labeled crR12, no arabinose) and 
cells containing high levels of taRNA (labeled taR12, 0.25% arabinose). 
[0022] Figure 3F (trans-activation results) shows normalized dose-response curves of 
taRlO-crRlO (solid line) and taR12-crR12 (dotted line) riboswitches at corresponding 
concentrations of arabinose. 

[0023] Figure 3G (trans-activation results) shows flow-cytometric (grey & black bars) 
and taRNA concentration (white & striated bars) results of four riboregulator variants (taRL- 
-crR12, taR7-crR12, taR10-crR12, taR12-crR12) at low (grey and white, respectively) 
and high (black and striated, respectively) arabinose concentrations. All data presented is 
normalized to high GFP and RNA levels. 

[0024] Figure 3H (trans-activation results) shows a schematic representation of 
sequence specific taR12 — crR12 stable duplex. The 5' linear sequence of taRNA targets its 
complementary YUNR motif sequence (boxed, light grey) on crRNA. In the presence of 
taR12, the cis sequence (italicized) is destabilized and forms a stable taR12 — crR12 duplex 
(shown). The resulting duplex exposes the RBS (boxed, black) and allows translation to 
occur, (start codon (AUG) in bold, allowed G-U mispairings marked by black dots) 
[0025] Figure 4 shows a variety of stem-loop (hairpin) structures. Lines extending from 
one horizontal strand to another indicate base pairs. Lines that do not extend from one 
strand to the other indicate unpaired nucleotides. 
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[0026] Figure 5 shows the main set of plasmids used in the artificial riboregulator 
system. Cis-repressive RNA plasmids, pZE21alpha_G (i.e., pZE21alphaLG, see Table 4), 
contain PL(tetO) producing the cr sequence, loop, RBS and gfpmutib gene. Trans- 
activating RNA plasmids, pZE15Y_ (i.e., pZE15YL), contain the pBAD promoter 
expressing taRNA. In plasmids containing Riboregulator System I, pZE21alpha_G and 
pZE15Y_ were combined to form pZER21Y_alpha_G. All plasmids contained the ColEl 
origin of replication and gene coding for either ampicillin or kanamycin resistance. In 
Riboregulator System II, the PL(tetO) and pBAD promoters were replaced with the 
PL(lacO) and PL(tetO) promoters, respectively. See Table 4 for complete list of plasmids. 
[0027] Figure 6 shows reverse transcription profiles of complexes between crR7 
(upper), crRlO (middle), crR12 (lower), and taR7, taRlO, and taR12, as indicated with 
arrows. The peaks at 165-185 min and 210-230 min correspond to truncated transcripts 
and fiill length transcripts, respectively. All taRNA-crRNA pairings are denoted by 
arrows. 

[0028] Figure 7 shows reverse transcription profiles of the taR7— crR12 pair. The 
concentration of crR12 is kept constant (0,01 jiM); the concentration of taR7 decreases over 
lanes 1-6. Lanes represent (from top to bottom) 1.0 jjlM, 0.5 |iM, 0.25 ^M, 0.13 jiM, 0.06 
juM, and 0.03 jiM, respectively. The peaks are: 92 min ~ RT primer; 170-180 min - 
termination on cis-repressive secondary structure; 180-190 min - termination on the 
crRNA-taRNA complex; 210-220 min - termination on secondary structure (minor); 240 
min — full length reverse transcript. 

[0029] Figure 8 shows determination of equilibrium association and dissociation 
constants for the taR7~crR12 pair. Here, CR and TA denote the initial concentrations of 
crR12 and taR7, respectively; x=Sc/(S c +Sf), where S c and Sf are the peak areas of the 
complex and the full length transcript. The equation of the linear regression line is TA-x°CR 
= 1.03 *x/(l-x) - 0.008; K D is 1.03 ^M, K A is 9.7xl0 5 M" ! . 

[0030] Figures 9 A - 9D shows GFP expression results using an additional cis element. 
RBS is ribosome binding site, a refers to a first c/s-repressive element, p is the additional 
element, a is complementary to both RBS and (3 and can bind to either sequence. 
[0031] Figure 9A shows flow cytometry (expression) results for pZE21G (control). 
[0032] Figure 9B shows flow cytometry (expression) results for pZE21pG (control), 
where p~ cis sequence is inserted directly upstream of ribosome binding site (RBS) 
sequence). This results in a high expression state. 
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[0033] Figure 9C shows flow cytometry (expression) results for the structure with one 
cis-repressive element (pZE21alOLG), resulting in a low expression state. 
[0034] Figure 9D shows flow cytometry (expression) results for a structure in which the 
mRNA transcript contains both a and (i elements (pZE21pal0LG), exhibiting an 
intermediate expression state. 

Detailed Description of Certain Preferred Embodiments of the Invention 
[0035] I. Definitions 

[0036] The following definitions are of use in understanding the invention. 
[0037] Approximately: As used herein, the terms approximately or about in reference to 
a number are generally taken to include numbers that fall within a range of 5% in either 
direction (greater than or less than) the number unless otherwise stated or otherwise evident 
from the context (except where such number would exceed 100% of a possible value). 
Where ranges are stated, the endpoints are included within the range unless otherwise stated 
or otherwise evident from the context. 

[0038] Artificial Engineered, Synthetic: A nucleic acid molecule is referred to herein as 
"artificial", "engineered", or "synthetic" if it has been created or modified by the hand of 
man (e.g., using recombinant DNA technology) or is derived from such a molecule (e.g., by 
transcription, translation, etc.) A nucleic acid molecule may be similar in sequence to a 
naturally occurring nucleic acid but typically contains at least one artificially created 
insertion, deletion, inversion, or substitution relative to the sequence found in its naturally 
occurring counterpart. A ceil that contains an engineered nucleic acid is considered to be an 
engineered cell. 

[0039] Complementarity: For purposes of the present invention, complementarity of two 
sequences is determined by dividing the total number of nucleotides that participate in 
complementary base pairs (GC, AU, AT) when the sequences are aligned to produce the 
maximum number of complementary base pairs, counting all nucleotides in the two 
sequences including those in bulges, mismatches, or inner loops by the total number of 
nucleotides contained in both sequences. For example, consider two sequences of 19 and 20 
nucleotides in length in which alignment to produce the maximum number of 
complementary base pairs results in 16 base pairs, 1 inner loop of 2 nucleotides, 1 mismatch, 
and 1 bulge (in the sequence with 20 nucleotides). The percent complementarity of the two 
sequences is [(16+17)739] 100. It will be appreciated that complementarity may be 
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determined with respect to the entire length of the two sequences or with respect to portions 
of the sequences. 

[0040] Gene: For the purposes of the present invention, the term "gene" has its meaning 
as understood in the art. In general, a gene is taken to include gene regulatory sequences 
.(e.g., promoters, enhancers, etc.) and/or intron sequences, in addition to coding sequences 
(open reading frames). It will further be appreciated that definitions of "gene" include 
references to nucleic acids that do not encode proteins but rather encode functional RNA 
molecules such as tRNAs. For the purpose of clarity we note that, as used in the present 
application, the term "gene" generally refers to a portion of a nucleic acid that encodes a 
protein; the term may optionally encompass regulatory sequences. This definition is not 
intended to exclude application of the term "gene" to non-protein coding expression units 
but rather to clarify that, in most cases, the term as used in this document refers to a protein 
coding nucleic acid. 

[0041] Gene product or expression product: A "gene product" or "expression product" 
is, in general, an RNA transcribed from the gene or a polypeptide encoded by an RNA 
transcribed from the gene. Thus a regulatory element, environmental condition, stimulus, 
etc., that alters the level of transcription or the stability of an RNA transcribed from a gene 
or alters its ability to serve as a template for translation will be said to alter expression of the 
gene. Similarly, a regulatory element, environmental condition, stimulus, etc., that alters the 
level of translation or stability of a polypeptide translated from an RNA transcribed from the 
gene will be said to alter expression of the gene. 

[0042] Hairpin: A "hairpin" or "stem/loop" structure as used herein refers to a single 
nucleic acid molecule or portion thereof that includes a duplex (double helical) region (the 
stem) formed when complementary regions within the molecule hybridize to each other via 
base pairing interactions and further includes a single-stranded loop at one end of the 
duplex. Figures 4A-4D show various stem-loop structures. In various embodiments of the 
invention the double-stranded portion may include one or more mismatches, bulges, or inner 
loops. For purposes of description herein, the length of a stem is measured from the first 
pair of complementary nucleotides to the last pair of complementary bases and includes 
mismatched nucleotides (e.g., pairs other than AT, AU, GC), nucleotides that form a bulge, 
or nucleotides that form an inner loop. 

[0043] It is noted that although a hairpin is formed from a single nucleic acid molecule, 
the two portions of the molecule that form the duplex portion of the hairpin, i.e., the stem, 
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will be referred to herein as "strands". Thus the stem may be referred to herein as the 
double-stranded portion of the hairpin. Nucleic acid molecules containing complementary 
regions that form a duplex are said to be "self-complementary" or to "self-hybridize". In 
general, the hairpin and intermolecular duplexes described herein form at and are stable 
under physiological conditions, e.g., conditions present within a cell (e.g., conditions such as 
pH, temperature, and salt concentration). Such conditions include a pH between 6.8 and 7.6, 
more preferably approximately 7.4. Typical temperatures are approximately 37°C, although 
it is noted that prokaryotes and certain eukaryotic cells such as fungal cells can grow at 
lower (or, in some cases, higher) temperatures. 

[0044] As mentioned above, the stem may include one or more areas of non- 
complementarity, e.g., one or more mismatches, bulges, inner loops, or combinations of the 
foregoing. A mismatch occurs when the two strands include a single non-complementary 
nucleotide at corresponding positions that interrupt the continuity of the double-stranded 
portion (see Figure 4B) A bulge occurs when one of the two strands includes one or more 
"extra" nucleotide(s) that do not base pair with nucleotide(s) on the other strand but are 
surrounded by regions of double-strandedness (see, e.g., Figure 4C). An inner loop occurs 
when two or more consecutive mismatches exist in a stem, i.e., there are distinct 
complementary base pairs on either side of the inner loop (see, e.g., Figure 4D). An inner 
loop is to be distinguished from a "main loop" in that in the case of an inner loop, the two 
strands of the loop connect distinct base pairs, whereas in the case of a main loop, the single 
strand of the loop connects the two nucleotides of a single base pair. Various combinations 
of these types of areas of non-complementarity can also exist (see, e.g., Figure 4E). 
[0045] Isolated: As used herein, "isolated" means 1) separated from at least some of the 
components with which it is usually associated in nature; 2) prepared or purified by a 
process that involves the hand of man; and/or 3) not occurring in nature. The nucleic acid 
molecules of the invention may be isolated nucleic acid molecules. 
[0046] Nucleic acid molecule: "Nucleic acid molecule" or "polynucleotide" refers to a 
polymer of nucleotides joined by phosphodiester bonds. The term includes deoxyribonucleic 
acids (DNA) and ribonucleic acids (RNA), including messenger RNA (mRNA), transfer 
RNA (tRNA), etc. Typically, a nucleic acid molecule comprises at least three nucleotides. 
Nucleic acid molecules may be single stranded, double stranded, and also tripled stranded. 
A double stranded nucleic acid may comprise two separate strands of nucleic acid 
hybridized to each other through hydrogen bond-mediated base pairing interactions. A 
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double stranded nucleic acid may also comprise two regions of a single nucleic acid 
molecule that hybridize to each other to form secondary structure, e.g, a stem in a stem-loop 
(hairpin) structure. 

[0047] A nucleotide consists of a nucleoside, i.e., a nitrogenous base linked to a pentose 
sugar, and one or more phosphate groups which is usually esterified at the hydroxyl group 
attached to C-5 of the pentose sugar (indicated as 5') of the nucleoside. Such compounds are 
called nucleoside 5'-phosphates or 5 f -nucleotides. In a molecule of DNA the pentose sugar is 
deoxyribose, whereas in a molecule of RNA the pentose sugar is ribose. The nitrogenous 
base can be a purine such as adenine or guanine, or a pyrimidine such as cytosine, thymine 
(in deoxyribonucleotides) or uracil (in ribonucleotides). Thus, the major nucleotides of DNA 
are deoxyadenosine S'-triphosphate (dATP), deoxyguanosine 5 -triphosphate (dGTP), 
deoxycytidine 5-triphosphate (dCTP), and deoxythymidine 5 '-triphosphate (dTTP). The 
major nucleotides of RNA are adenosine 5-triphosphate (ATP), guanosine 5 -triphosphate 
(GTP), cytidine 5 '-triphosphate (CTP) and uridine S'-triphosphate (UTP). In general, stable 
base pairing interactions occur between adenine and thymine (AT), adenine and uracil (AU), 
and guanine and cytosine (GC). Thus adenine and thymidine, adenine and uracil, and 
guanine and cytosine (and the corresponding nucleosides and nucleotides) are referred to as 
complementary. 

[0048] In general, one terminus of a nucleic acid molecule has a 5-hydroxyl group and 
the other terminus of the molecule has a 3'-hydroxyl group; thus the nucleotide chain has a 
polarity. By convention, the base sequence of a nucleic acid molecule is written in a 5 ! to 3' 
direction, which is also the direction in which RNA transcription occurs. Thus in general a 
DNA sequence presented herein will have the same sequence as an RNA transcribed using 
the DNA as a template, i.e., the sequence of the non-template DNA strand will be given. 
[0049] In various embodiments of the invention a nucleic acid molecule may include 
nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyiTolo-pyrimidine, 
3-methyl adenosine, C5-propynylcytidine, C5-propynyluridine, C5-bromouridine, 
C5-fluorouridine, C5-iodouridine, C5-methylcytidine, 7-deazaadenosine, 7-deazaguanosine, 
8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thiocytidine), chemically 
modified bases, biologically modified bases (e.g., methylated bases), intercalated bases, 
modified sugars (e.g., 2'-fluororibose, ribose, 2 '-deoxyribose, arabinose, and hexose), or 
modified phosphate groups (e.g., phosphorothioates and 5'-N-phosphoramidite linkages). 
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[0050] A nucleic acid molecule or portion thereof may also be referred to as a "nucleic 
acid segment* a ''nucleic acid element", or a "nucleic acid sequence". 
[0051] Operably linked: As used herein, "operably linked" refers to a relationship 
between two nucleic acid sequences wherein the expression of one of the nucleic acid 
sequences is controlled by, regulated by, modulated by, etc., the other nucleic acid sequence. 
For example, the transcription of a nucleic acid sequence is directed by an operably linked 
promoter sequence; post-transcriptional processing of a nucleic acid is directed by an 
operably linked processing sequence; the translation of a nucleic acid sequence is directed 
by an operably linked translational regulatory sequence; the transport or localization of a 
nucleic acid or polypeptide is directed by an operably linked transport or localization 
sequence; and the post-translational processing of a polypeptide is directed by an operably 
linked processing sequence. Preferably a nucleic acid sequence that is operably linked to a 
second nucleic acid sequence is covalently linked, either directly or indirectly, to such a 
sequence, although any effective three-dimensional association is acceptable. 
[0052] Purified: As used herein, "purified" means separated from many other 
compounds or entities. A compound or entity may be partially purified, substantially 
purified, or pure, where it is pure when it is removed from substantially all other compounds 
or entities, i.e., is preferably at least about 90%, more preferably at least about 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99%, or greater than 99% pure. 

[0053] Regulatory sequence or element: The term regulatory sequence is used herein to 
describe a region of nucleic acid sequence that directs, enhances, or inhibits the expression 
(e.g., transcription, translation, processing, etc.) of sequence(s) with which it is operatively 
linked. The term includes promoters, enhancers and other transcriptional control elements. 
The term additionally encompasses the cis and trans riboregulators of the invention. In 
some embodiments of the invention, regulatory sequences may direct constitutive expression 
of a nucleotide sequence; in other embodiments, regulatory sequences may direct tissue- 
specific and/or inducible or repressible expression. 

[0054] Small molecule: As used herein, the term "small molecule" refers to organic 
compounds, whether naturally-occurring or artificially created (e.g., via chemical synthesis) 
that have relatively low molecular weight and that are not proteins, polypeptides, or nucleic 
acids. Typically, small molecules have a molecular weight of less than about 1500 g/mol. 
Also, small molecules typically have multiple carbon-carbon bonds. 
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[0055] Substantially complementary: Two sequences are considered "substantially 
complementary" herein if their complementarity is at least 50%. 

[0056] Vector: In general, the term vector refers to a nucleic acid molecule capable of 
mediating entry of, e.g., transferring, transporting, etc., a second nucleic acid molecule into a 
cell. The transferred nucleic acid is generally linked to, e.g., inserted into, the vector nucleic 
acid molecule. A vector may include sequences that direct autonomous replication, or may 
include sequences sufficient to allow integration into host cell DNA. Useful vectors include, 
for example, plasmids (typically DNA molecules although RNA plasmids are also known), 
cosmids, and viral vectors. 
• [0057] II. Overview 

[0058] Traditionally, most RNA molecules have been thought to be critical messengers 
of information from genes to the proteins they encode (1, 2). RNA also serves in other 
diverse roles within the cell, namely protein synthesis, RNA splicing and editing, rRNA 
modification, and more (1, 2). In addition, small RNAs (sRNA) can act as ribozymes (3-5), 
in which RNA catalyzes biochemical reactions, and as regulators that control the translation 
and degradation of messengers. These sRNAs, or noncoding RNAs (ncRNA), are involved 
in various structural, regulatory, and enzymatic capacities (6). Noncoding RNAs, which 
likely operate as key regulators in prokaryotic and eukaryotic cellular networks, were first 
identified in studies describing the plasmid-encoded antisense RNAs in bacteria (7, 8) and 
developmental mutants in Caenorhabditis elegans (9). It has recently been shown that RNA 
sequences can act as environmental sensors of vitamin cofactors and temperature, enabling 
them to directly regulate gene expression (10-16). In general, regulatory RNAs act by using 
base complementarity or sensing environmental cues to either repress or, more rarely, 
activate (17) translatioa Such natural mechanisms, which target post-transcriptional 
regulation, provide a basis for the development of synthetic RNA regulators (riboregulators). 
[0059] Its diverse structure, mode of action, and broad utility in nature contribute to the 
multifaceted abilities of RNA, particularly its role as a regulator of cell behavior. In vitro 
selection of nucleic acids has yielded novel molecules that exhibit desired catalytic, 
structural, and complementary base pairing properties (18-25). By exploiting these 
attributes, RNA can be used to direct complex interactions, such as the ability to control a 
target gene. Numerous strategies of RNA-mediated silencing of gene expression have been 
used in prokaryotes, involving gene knockout techniques, deletions, point mutations (26-29), 
and an antisense-based technology that identifies gene targets for antibiotic discovery (30). 
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The present invention utilizes RNA's versatility to control post-transcriptional gene 
regulation through both repression and activation. 

[0060] The present invention provides systems and methods for the post-transcriptional 
control of gene expression in prokaryotic or eukaryotic cells. The invention provides an 
artificial RNA-based system that enables precise control through highly specific RNA-RNA 
interactions. In contrast to existing engineered post-transcriptional schemes in bacteria, 
where repression is achieved through antisense RNA or /ra/tf-acting ribozymes (31, 32), 
according to the present invention effective repression is achieved by engineering an RNA 
molecule (or template for the RNA molecule), so that the engineered RNA forms a 
secondary structure that prevents the ribosome from gaining access to the RNA at an 
appropriate location to begin translation. 

[0061] The invention employs RNA molecules both as gene silencers and activators. 
Repression of gene expression is achieved through the presence of a regulatory nucleic acid 
element (the cw-repressive RNA or crRNA) within the 5' untranslated region (5' UTR) of 
an mRNA molecule. The nucleic acid element forms a hairpin (stem-loop) structure through 
complementary base pairing. (See Figure 4 for examples of various stem-loop structures). 
The hairpin blocks access to the mRNA transcript by the ribosome, thereby preventing 
translation. In particular, in embodiments of the invention designed to operate in 
prokaryotic cells, the stem of the hairpin secondary structure sequesters the ribosome 
binding site (RBS). In embodiments of the invention designed to operate in eukaryotic cells, 
the stem of the hairpin is positioned upstream of the start codon, anywhere within the 5* 
UTR of an mRNA. 

[0062] According to the invention a small RNA (/ra/w-activating RNA, or taRNA), 
expressed in trans, interacts with the crRNA and alters the hairpin structure. This alteration 
allows the ribosome to gain access to the region of the transcript upstream of the start codon, 
thereby activating transcription from its previously repressed state. Corresponding pairs of 
crRNA and taRNA elements (i.e., pairs in which the taRNA interacts with the crRNA to 
relieve repression of translation) are referred to as cognate pairs. In general, such cognate 
pairs include complementary or, preferably, substantially complementary portions at least 6 
nucleotides in length, preferably between 6 and 50 nucleotides in length, e.g., between 12 
and 40 nucleotides in length, between 20 and 30 nucleotides in length, inclusive. In order to 
facilitate understanding of the invention, the following section briefly describes certain 
aspects of the process of gene expression in prokaryotes and eukaryotes. The design and 
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features of the m-repressive and /ra/w-activating nucleic acid molecules of the invention are 
then described in further detail. 

[0063] HI. Translation in Prokarvotes and Eukarvotes 

[0064] Figure 1 A illustrates the basic steps of native prokaryotic gene expression (65). 
As shown in the figure, a promoter, P, drives the expression of a gene (the gene that encodes 
GFP is used for illustrative purposes). In prokaryotes, gene expression begins when RNA 
polymerase recognizes and binds to sequences (-35 and -10 consensus sequences) present in 
a promoter region of the bacterial DNA (or in an extrachromosomal element such as a DNA 
plasmid). Transcription begins at the start site, which is located a short distance downstream 
of (i.e., in the 3' direction from) the binding site and generally terminates when a stop 
(termination) signal is encountered. After transcription, mRNA is present with a ribosome 
binding site (RBS) available for docking by a ribosome in the 5' UTR Naturally occurring 
ribosome binding sites typically comprise a sequence referred to as the Shine-Dalgarno 
sequence typically about six nucleotides long (although shorter and longer sequences exist), 
which can occur at several places in the same mRNA molecule. These sequences are 
generally located four to seven nucleotides upstream from a start codon, and they form base 
pairs with a specific region of the rRNA in a ribosome to signal the initiation of protein 
synthesis at this nearby start codon. The small (30S) ribosomal subunit recognizes the RBS 
and forms an initiation complex along with a tRNA having an anticodon (e.g., UAC) 
complementary to the start codon (e.g., AUG) and carrying an altered form of the amino 
acid methionine (N-formylmethionine, or f-Met) and protein initiation factors. The 
initiation process establishes the correct reading frame for synthesis of a functional protein. 
Activity of a RBS may be influenced by the length and nucleotide composition of the spacer 
which separates the RBS and the initiator (AUG) (62). Prokaryotic mRNAs may contain 
multiple ribosome binding sites, each upstream of a start codon, resulting in synthesis of a 
number of different polypeptides from a single mRNA (i.e., the mRNA is polycistronic). 
[0065] Following binding of the small ribosomal subunit, initiation factors that were 
associated with the small ribosomal subunit depart, and a large (SOS) ribosomal subunit 
attaches to form the 70S ribosome. Since the initiator tRNA molecule is bound to the 
ribosome, synthesis of a protein chain can commence with the binding of a second 
aminoacyl-tRNA molecule to the ribosome. As new peptide bonds are formed in the 
elongation phase of protein synthesis, the ribosome moves along the mRNA, making way 
for entry of the next ribosome upstream of the start codon. Elongation typically continues 
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until the ribosome encounters a stop codon, at which point the ribosome releases the mRNA 
and disocciates. 

[0066] Protein synthesis in eukaryotes occurs by a broadly similar process, with some 
significant differences. Eukaryotic mRNAs typically undergo a variety of modifications in 
the nucleus prior to exit into the cytoplasm. In particular, most eukaryotic mRNAs are 
modifed by the addition of a "cap" structure composed of a 7-methylguanosine residue 
linked to a triphosphate at the 5' end. This 5' cap structure plays an important role in 
protein synthesis. Unlike the case in prokaryotes, where correct positioning of the small 
ribosomal subunit depends on binding to the RBS, in eukaryotic cells the small ribosomal 
subunit first binds at the 5 ' end of an mRNA chain in a process that involves recognition of 
the 5' cap. The small subunit then moves along the mRNA in a 3' direction, searching for 
an AUG codon. Typically the first AUG codon is selected, although a few nucleotides in 
addition to the AUG are also important for the selection process. Although the most 
efficiently used AUG triplets are embedded within a sequence (referred to as a Kozak 
consensus sequence) such as ACC AUG G or GCCG/ACC AUG C (SEQ ID NO:l) (the 
initiation codon is underlined) almost any AUG can be used (55-61). 
[0067] In most cases, once a start codon near the 5' end of an mRNA has been selected, 
downstream AUGs will not serve as sites for the initiation of protein synthesis unless the 
mRNA contains an internal ribosome binding site (IRES). However, an IRES positioned 5' 
to an additional coding sequence directs the co-translation of multiple open reading frames 
(ORF) from a single polycistronic RNA message. Briefly, IRES are cis-acting elements that 
recruit the small ribosomal subunits to an internal initiator codon in the mRNA with the aid 
of cellular trans-acting factors (for a review, see 52). A polycistronic message having 
correctly positioned IRES sequences directs the co-translation of multiple ORFs in a 
polycistronic mRNA. 

[0068] IV. Design of Czs-repressive Sequence and Cfc-repressive RNA Elements 
[0069] This section describes the design of cw-repressive sequences and RNA elements 
that contain them (as-repressive RNA) and the construction of templates for their synthesis. 
For purposes of convenience in the description, references to nucleic acid elements such as 
start codons, ribosome binding site, 5' UTR, stem-loop, etc., may refer to either the RNA 
form or to the DNA form (i.e., to a DNA molecule that provides a template for transcription 
of the RNA). Similarly, when reference is made to modifying an RNA (e.g., by inserting an 
element such as a c/s-repressive sequence) into the RNA, it is to be understood that the 
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modification is generally accomplished by engineering the appropriate modification to a 
DNA molecule that provides a template for transcription of the RNA. 
[0070] In both prokaryotic and eukaryotic systems, the ribosome must be able to gain 
access to the start codon. The major start codon is AUG, although the minor start codons 
GUG, AUC, and UUG are sometimes used, typically in prokaryotes (53, 65). In 
prokaryotes, the small ribosomal subunit must be able to bind to the RBS, while in 
eukaryotes the small ribosomal subunit must be able to progress in a 3' direction from the 5' 
end of the mRNA until it encounters the start codon or must be able to bind to the IRES. A 
variety of naturally occurring regulatory systems control translation by interfering with these 
processes (e.g., 14, 17, 31, 43). The inventors have recognized that mechanisms similar to 
those involved in naturally occurring regulatory processes, e.g., formation and disruption of 
RNA secondary structures, may be employed to afford control over gene expression. In 
particular, the inventors have designed nucleic acid elements that can be inserted into an 
RNA transcript (e.g., via insertion into a template for synthesis of the RNA transcript), so 
that the resulting RNA molecule assumes a hairpin (stem/loop) secondary structure that 
prevents access to the appropriate portion of the transcript by the small ribosomal subunit. 
[0071] For purposes of illustration, a riboregulator system for use in prokaryotic cells 
will first be described. Differences for eukaryotic systems are described below. It will be 
assumed herein that the start codon is AUG, but it is to be understood that the invention can 
be modified to operate in an essentially identical manner with alternate start codons such as 
GUG, UUG, or AUC, simply by replacing AUG by GUG, UUG, or AUC (or, in DNA, 
replacing ATG by GTG, TTG, or ATC) and, in those embodiments of the invention in 
which the start codon forms part of the crRNA stem, by changing the sequences of 
complementary nucleic acids appropriately. 
[0072] A. C/s-repressive Sequence 

[0073] As shown in Figure IB, in the artificial riboregulator, a small sequence, referred 
to as the cw-repressive sequence (cr), complementary to the RBS, is cloned downstream of 
the promoter (P cr ) and upstream of the RBS. The ^-repressive sequence may, but need not, 
replace part or all of the endogenous sequence of a 5'UTR. Thus addition of the cis- 
repressive sequence may occur by insertion and/or substitution. Addition of the cis- 
repressive sequence therefore may, but need not, alter the length of the 5' UTR. In general, 
the as-repressive sequence may be located at the 5' end of the UTR, or the 5' portion of the 
UTR upstream of the czs-repressive sequence can be of any length, e.g., at least 2 
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nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 10 nucleotides, at least 25 
nucleotides, at least 50 nucleotides, etc. In addition, there may be one or more ORFs 
upstream of the as-repressive sequence._Following transcription, a hairpin is formed within 
the 5'UTR of the mRNA, which blocks ribosome docking and translation (cw-repression), 
as shown in Figure 1. An RNA molecule comprising a cis-repressive sequence and further 
comprising various additional elements discussed below is referred to as a cis-repressive 
RNA (crRNA). 

[0074] According to an additional aspect of the invention described in more detail 
below, a second promoter, P /fl , expresses a small, non-coding RNA (fran^-activating RNA, 
taRNA) that targets the crRNA with high specificity. The taRNA and crRNA undergo a 
linear-loop interaction that exposes the obstructed RBS and permits activation of expression 
by allowing translation to occur. Figures 2 and 3 show additional crRNA and taRNA 
structures that were constructed. A comparison of the repressive and activating abilities of 
these structures allowed the inventors to refine the basic structure of the crRNA and taRNA 
to optimize their activities. 

[0075] As shown in Figures 1 and 3C (left portion, labeled crRNA) the presence of an 
engineered cw-repressive sequence (cr) upstream of the start codon in an mRNA results in 
formation of a double-stranded stem-loop (hairpin) structure that prevents the ribosome from 
gaining access to the appropriate location on the mRNA from which to initiate translation 
from the downstream start codon. As described in Example 1, the presence of a cis- 
repressive sequence dramatically reduces translation relative to that which occurs when a 
sequence that does not form such a hairpin is positioned upstream of the start codon in an 
mRNA. Example 1 describes construction of crRNA elements comprising cw-repressive 
sequences and their insertion into a DNA construct under control of (i.e., in operable 
association with) an inducible promoter and upstream of an open reading frame that encodes 
a reporter molecule (green fluorescent protein). Measurement of the activity of the reporter 
(fluorescence) provides a measure of the translation of the mRNA. 

[0076] Table 1 presents data showing that insertion of cw-repressive sequence present in 
crRNA structures crRL, crR7, crRlO, and crR12 repressed translation by > 96% at 
intermediate levels of transcription of the mRNA comprising the sequence and by > 97% at 
high transcription levels. It is noted that this level of post-transcriptional repression exceeds 
that achieved heretofore using antisense RNA provided in trans. Thus in certain preferred 
embodiments of the invention translation is repressed by at least 70%, at least 80%, at least 
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90%, or at least 95%. (Note that in Table 1 the % complementarity was calculated by 
computing the total number of matches between the nucleotides in the ris-repressive 
sequence and the corresponding sequences (i.e., the total number of matches in the stem) 
divided by the total length of the stem.) In the calculations presented in Table 1, 
background autofluorescence was not subtracted from the values obtained in the repressed 
and non-repressed states. Subtracting this background results in a more accurate 
computation of the actual degree of repression. When background autofluorescence was 
subtracted, crRNA structures crRL, crR7, crRlO, and crR12 repressed translation by > 98% 
at intermediate or high levels of transcription of the mRNA comprising the sequence. 
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[0077] Table 1 : Percent (%) Sequence Complementarity of Cis Element to the RBS. 
Predicted AGmfold (35, 46) obtained from the MFOLD server. Concentrations of mRNA 
obtained from competitive PCR coupled with MALDI-TOF mass spectrometry are 
internally normalized to 16S rKNA levels within each sample. FL1 values represent 
measured GFP expression levels (arbitrary units) obtained by flow cytometry. RNA and 
FL1 normalized valuees represent each sample normalized to the control (crRNA/C), which 
lacks the cis sequence. Fold induction (+aTc/-aTc) depicts the change in RNA and FL1 
levels between high and low transcription rates within each column. 





Control 


crRL 


crR7 


crRlO 


crR12 


% RBS Sequence 




100 


89 


84 


84 


Complementarity 












AGa/foid (kcal/mol) 


-4.6, -4.8 


-27.6 


-23.7 


-16 


-15.6 


-AtC: mRNA 1 


0.364 ± .077 


0.135 ±.014 


0.154 ± .022 


0.152 ±.022 


0.144 ±.033 


RNA normalized 


1 


.37 


.42 


.42 


.40 


FL1 


113.10±15.8 


2.55 ± .02 


3.75 ±.19 


3.41 ±.03 


2.91 ± .05 


FL1 normalized 


1 


0.022 


0.033 


0.030 


0.026 


+ aTc: mRNA 1 


1.53 ±.176 


0.611 ±.113 


0.629 ± .043 


0.628 ±.096 


0.540 ±.098 


RNA normalized 


1. 


40 


.40 


.41 


35 


FL1 


640.5 ± 25 


4.06 ±.19 


13.61 ±1.12 


10.05 ±.12 


6.55 ±.14 


FL1 normalized 


1 


0.006 


0.021 


0.016 


0.010 


+aTc RNA 


4.2 


4.5 


4.1 


4.1 


3.8 


-aTc 












FL1 


5.7 


1.6 


3.6 


2.9 


2.2 



[0078] In certain preferred embodiments of the invention the hairpin stem formed by 
base pairing between the cw-repressive sequence and sequences between the 3* end of the 
ris-repressive sequence and the 5* end of the ORF is at least 4 nucleotides in length, e.g., 
between 4 and approximately 100 nucleotides in length. In certain embodiments of the 
invention the stem is between approximately 6 and 50 nucleotides in length. In certain 
embodiments of the invention the stem is between approximately 10 and 30 nucleotides in 
length, e.g., 15-25 nucleotides in length. In certain preferred particular embodiments of 
the invention the stem is 18 - 20 nucleotides, or 19 nucleotides in length. In general, shorter 
stems result in decreased repression of translation (leakiness), particularly when the stem 
includes one or more mismatches, bulges, or inner loops as is the case in certain preferred 
embodiments of the invention (see below). Thus in general increased repression may be 
achieved by using a longer stem length. However, in order to achieve efficient reversibility 
of the repression by a fra?w-activating RNA, it may be preferable to avoid extremely long 
stems. In addition, longer stems (in the absence of mismatches, bulges, or inner loops, may 
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activate RNAse HI (in prokaryotes) or the interferon response (in mammals) or similar 
responses in other eukaryotes such as plants, leading to undesired degradation of the 
transcript. Furthermore, for certain applications it may be desirable to utilize a cis- 
repressive sequence that offers less than the maximum obtainable degree of repression. For 
example, to determine gene dosage effects it may be preferable to achieve a "knock-down" 
rather than a "knock-ouf ' of gene expression. It is noted that in certain embodiments of the 
invention the hairpin stem formed by base pairing between the cw-repressive sequence and 
sequences between the 3' end of the ^-repressive sequence and the 5' end of the ORF may 
also include a portion of the 5* end of the ORF. In other words, the sequence at the 5' end 
of the cz>repressive sequence, or the sequence upstream of the cw-repressive sequence may 
be complementary or substantially complementary to a portion of the downstream ORF. 
[0079] In prokaryotes the hairpin stem preferably encompasses part or, more preferably 
all, of the ribosome binding site. Thus the sequence of the ris-repressive sequence is 
complementary, or, preferably, substantially complementary to the RBS sequence. In 
certain embodiments of the invention the as-repressive sequence is at least 66% 
complementary to the RBS. In other embodiments of the invention the c/s-repressive 
sequence is at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% 
complementary to the RBS. In certain embodiments of the invention the ^-repressive 
sequence and the RBS display between 80% and 90% complementarity. While not wishing 
to be bound by any theory, it is likely that the presence of one or more mismatches, bulges, 
or inner loops in the duplex formed by the czs-repressive sequence and the RBS decreases 
the stability of the duplex, which increases the likelihood that the duplex region will undergo 
a conformational change in the presence of a cognate taRNA (see below) so that 
derepression of translation can occur. 

[0080] In eukaryotes the hairpin may be located anywhere within the 5' UTR upstream 
of the start codon (or, in the case of an mRNA that includes an IRES, anywhere between the 
IRES and the start codon), or may include a small portion of the 5' region of the ORF. In 
eukaryotes the most 3' nucleotide in the hairpin stem is preferably located within 100 
nucleotides of the start codon, more preferably within 50 nucleotides of the start codon, 
more preferably within 20 nucleotides of the start codon. In certain embodiments of the 
invention the hairpin stem encompasses part or all of a Kozak consensus sequence. 
[0081] As mentioned above, in certain preferred embodiments of the invention the cis- 
repressive sequence is longer than the RBS or includes only part of the RBS, so that the 
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hairpin stem involves one or more nucleotides between the 3 5 end of the cw-repressive 
sequence and the 5' end of the ORF in addition to, or instead of, the RBS. For example, the 
cis-repressive sequence may be 19 nucleotides in length, and the RBS may be 6-8 
nucleotides in length, as shown in crRNA structures crRL, crR7, crRlO, and crR12, shown 
in Figures 2A and 3, in which the cis-repressive sequences are in italics and the RBS 
sequences are boxed. In this case the hairpin stem includes additional sequences 
downstream of and upstream of the RBS. In certain embodiments of the invention the stem 
may encompass all or part of the start codon. 

[0082] In general, the sequence of the cw-repressive sequence is complementary, or, 
preferably, substantially complementary to a portion of the sequence between the 3' end of 
the cw-repressive sequence and the 5' end of the ORF. In certain embodiments of the 
invention the m-repressive sequence is at least 66% complementary to a portion of the 
sequence between the 3' end of the cw-repressive sequence and the 5' end of the ORF, equal 
in length to the c/s-repressive sequence. In other embodiments of the invention the cis- 
repressive sequence is at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% 
complementary to a portion of the sequence between the 3* end of the m-repressive 
sequence and the 5'end of the ORF equal in length to the cwr-repressive sequence. In certain 
embodiments of the invention the cw-repressive sequence and a portion of the sequence 
between the 3' end of the cw-repressive sequence and the 5'end of the ORF display between 
80% and 90% complementarity. While not wishing to be bound by any theory, it is likely 
that the presence of one or more mismatches, bulges, or inner loops in the duplex formed by 
the cw-repressive sequence and the portion of sequence between the 3' end of the cis- 
repressive sequence and the 5' end of the ORF increases the likelihood that the duplex 
region will undergo a conformational change in the presence of a cognate taRNA (see 
below) so that derepression of translation can occur. 

[0083] The degree of complementarity may also be considered in terms of the ratio of 
the number of nucleotides in complementary nucleotide pairs to the sum of the number of 
nucleotides that are present in mismatches, bulges, or inner loops. According to this 
approach, in certain embodiments of the invention a desirable ratio is between 4:1 and 8:1, 
or between 5:1 and 7:1, or approximately 6:1. 

[0084] In addition to the absolute degree of complementarity between the as-repressive 
sequence and the RBS and/or the absolute degree of complementarity between the cis- 
repressive sequence and a portion of the sequence between the 3' end of the cw-repressive 
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sequence and the ORF, the nature and location of the non-complementary regions are 
significant. In general, the non-complementary portions of the stem may be mismatches, 
bulges, and/or inner loops. In preferred embodiments of the invention one or more 
mismatches, bulges, or inner loops exist within the stem formed by the cw-repressive 
sequence and a portion of the sequence between the 3' end of the m-repressive sequence 
and the ORF. In certain embodiments of the invention 2, 3, 4, or 5 mismatches, bulges, or 
inner loops exist in this region. In general, it is preferred that a bulge comprises between 1 
and 4 nucleotides, e.g., 1, 2, 3, or 4 nucleotides. In certain embodiments of the invention a 
bulge comprises 1 unpaired nucleotide. In general it is preferred that an inner loop 
comprises 5 or fewer nucleotides, e.g., 1, 2, 3, 4, or 5 nucleotides in each strand of the stem. 
In certain embodiments of the invention an inner loop comprises 2 nucleotides in each 
strand of the stem. 

[0085] Preferably the areas of non-complementarity are dispersed at various locations 

within the loop. By "dispersed" is meant that at least one complementary base pair exists 

i 

between any two areas of non-complementarity. In certain embodiments of the invention at 
least 2, at least 3, at least 4, at least 5, at least 6, at least 7, or at least 8 base pairs separate 2 
or more areas of non-complementarity. It may also be desirable to have at least 2 or 3 
nucleotide pairs between the areas of non-complementarity and the last base pair in the stem. 
For example, Fig. 2A and Fig. 3 illustrate certain preferred configurations for the areas of 
non-complementarity in the stems present in crRlO and crR12. Both stems contain 3 
dispersed mismatches. The mismatches are separated from each other by at least 3 
nucleotide pairs and the outer 2 mismatches are each positioned at least 3 nucleotide pairs 
away from the respective ends of the stem. 

[0086] It will be evident to one of ordinary skill in the art that a variety of alternate 
configurations are possible without departing from the guidelines described above. In 
general, the key consideration is the desirability of introducing one or more areas of non- 
complementarity so as to confer partial instability on the stem-loop structure so that 
conformational change can occur in the presence of the cognate taRNA. In this regard it is 
noted that for purposes of the present invention all base pairings other than the cognate base 
pairings (AT, AU, GC) are considered mismatches. However, allowable pairings such as 
GU (wobble base pairs) will confer less instability than pairings such as UU, GG, etc. 
In general, the degree of partial instability is reflected in the change in free energy 
associated with folding, which can be calculated using a variety of computer programs 
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known in the art. For example, the inventors calculated AGmfold using the MFOLD 
program as described in Example 1. 

[0087] The ability of any particular sequence to function as an effective repressor of 
translation may readily be tested by inserting it upstream of an RBS within a 5' UTR of a 
transcript of choice (e.g., one that encodes a reporter molecule) and measuring the resulting 
translation. Example 1 describes such measurements, in which various cw-repressive 
sequences were located within a larger RNA molecule (the crRNA) that can be conveniently 
inserted upstream of any ORF of choice. 

[0088] In Example 1, GFP was used as a reporter. However, any of a wide variety of 
different reporters could be used, including fluorescent or chemiluminescent reporters (e.g., 
GFP variants, luciferase, e.g., luciferase derived from the firefly (Photinus pyralis) or the sea 
pansy (Renilla reniformis) and mutants thereof), enzymatic reporters (e.g., p-galactosidase, 
alkaline phosphatase, DHFR, CAT), etc. The eGFPs are a class of proteins that has various 
substitutions (e.g., Thr, Ala, Gly) of the serine at position 65 (Ser65). The blue fluorescent 
proteins (BFP) have a mutation at position 66 (Tyr to His mutation) which alters its emission 
and excitation properties. This Y66H mutation in BFP causes the spectra to be blue-shifted 
compared to the wtGFP. Cyan fluorescent proteins (CFP) have a Y66W (Tyr to Trp) 
mutation with excitation and emission spectra wavelengths between those of BFP and eGFP. 
Sapphire is a mutant with the excitation peak at 495 nM suppressed while still having the 
excitation peak at 395 and the emission peak at 5 1 1 nM. Yellow FP (YFP) mutants have an 
aromatic amino acid (e.g. Phe, Tyr, Trp) at position 203 and have red-shifted emission and 
excitation spectra. 
[0089] B. Loop Sequence 

[0090] The c/s-repressive sequences described above may be positioned upstream of an 
endogenous or synthetic RBS of choice, without changing or replacing any of the sequences 
between the 3' end of the cw-repressive sequence and the 5* end of the ORF. In such a case 
the sequence of the c/s-repressive sequence is selected to achieve the desired degree of 
complementarity in the hairpin stem, while the loop consists of whatever sequence is present 
between the 3' end of the cw-repressive sequence and the 5' end of the sequence with which 
it pairs to form the stem. In general, therefore, the length of the loop depends on the 
positioning of the cw-repressive sequence with respect to downstream complementary 
sequences. In certain preferred embodiments of the invention the length of the loop is 
between 3 and 15 nucleotides inclusive, between 4 and 10 nucleotides inclusive, between 4 
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and 8 nucleotides inclusive, or between 5 and 7 nucleotides, inclusive, e.g., 5, 6, or 7 
nucleotides. 

[0091] In addition, in order to achieve derepression in the presence of the cognate 
taRNA, in certain preferred embodiments of the invention the loop comprises a YUNR 
(pYrimidine-Uracil-Nucleotide-puRine) sequence, where Y stands for a pyrimidine (e.g., U 
or C in RNA, T or C in DNA), U stands for uracil, N stands for any nucleotide, and R stands 
for a purine (e.g., A or G). For example, a suitable YUNR sequence is UUGG. The YUNR 
sequence has been shown to be important for intermolecular RNA complex formation in the 
naturally occurring Rl system (34). While not wishing to be bound by any theory, it is 
likely that this sequence facilitates a linear-loop intermolecular interaction with a cognate 
taRNA that includes a nucleotide sequence complementary to the YUNR motif. The YUNR 
sequence may be located anywhere within the loop. 
[0092] C. C^-repressive RNA Elements 

[0093] The cw-repressive sequence and loop sequence described above may be 
combined to form a single RNA element which, together with additional sequences, can be 
positioned upstream of any ORF (e.g., either inserted into or replacing part of the 5' UTR) in 
order to repress translation. Such composite RNA elements are referred to herein as cis- 
repressive RNA (crRNA). In addition to the cw-repressive sequence and loop, a crRNA 
element comprises a sequence substantially complementary to the cis-repressive sequence. 
In implementations for prokaryotic systems, this sequence typically comprises an RBS. The 
crRNA thus comprises a first stem-forming portion (the cw-repressive sequence) and a 
second stem-forming portion, wherein the two stem-forming portions are complementary or, 
preferably, substantially complementary, and wherein the two stem-forming portions are 
connected by a non-stem-forming portion that forms a loop connecting the 3' end of the first 
stem-forming portion and the 5' end of the second stem-forming portion. The loop sequence 
preferably includes a YUNR motif. Preferred lengths of the two stem-forming portions, and 
preferred degrees of complementarity between the stem-forming portions are as described 
above for the case in which the cw-repressive sequence form a stem with sequences between 
the 3' end of the czs-repressive sequence and the 5* end of the ORF. Here the crRNA 
element provides the loop and some or all of the sequences between the 3' end of the cis- 
repressive sequence and the 5' end of the ORF. 

[0094] The crRNA may further include a start codon, e.g., AUG. The AUG may be 
positioned downstream of (i.e., in a 3' direction from) the 3' end of the second stem-forming 



25 



WO 2004/046321 



PCT/US2003/036506 



portion as shown in Figure 3 A. There may, but need not be, one or more nucleotides 
between the 3' end of the second complementary portion and the AUG. In other 
embodiments of the invention the AUG forms part of the second stem-forming portion and 
thus participates in formation of the stem. 

[0095] In general, preferred crRNA sequences include a spacer region between the 3' 
end of the RBS and the start codon. In prokaryotes, presence of such a spacer contributes to 
a high level of translation (62). For example, as shown in Figures 2 and 3, the sequence 
AAGGUACC is present between the 3' end of the RBS and the AUG. It is noted that this 
sequence contains a restriction site, to facilitate cloning, although this is not required. Other 
restriction sites could also be used. In certain embodiments of the invention the spacer has 
the sequence GTTTTTACC. It has been shown that the sequence 
AGGAGG GTTTTTACC AUG (SEQ ID NO:2) (in which the RBS and start codon are 
underlined) can support a high level of translation in both prokaryotic and eukaryotic 
systems (61). 

[0096] The crRNA may, but need not, include a single-stranded portion upstream of 
(i.e., in the 5' direction from) the first stem-forming portion. In crRlO, for example, the 
single-stranded portion has the sequence GAAUUC. However, in general this portion may 
have any sequence, including the sequence of part or all of the 5' UTR of a gene. In 
general, this sequence may have any length and may represent any portion of an mRNA 
transcript located upstream of the RBS that is sequestered by the crRNA. 
[0097] It will be appreciated that if the template for transcription of a crRNA is present 
within a plasmid or is integrated into the cellular genome, some or all of the crRNA 
elements may be provided by the plasmid or by the endogenous DNA. For example, DNA 
that provides a template for transcription of the first stem-forming portion and the loop may 
be inserted into genomic DNA upstream of an endogenous RBS. In this case some or all of 
the second complementary portion and the AUG will be provided by the genomic DNA. 
[0098] An exemplary structure of a crRNA of the invention (crR12) is depicted in 
Figure 3B. The crRNA comprises first and second substantially complementary stem- 
forming portions that hybridize to form a double stranded stem and an intervening non-stem- 
forming portion that forms a single-stranded loop joining the 3' end of the first stem- 
forming portion to the 5' end of the second stem-forming portion. The first stem-forming 
portion comprises a cw-repressive sequence, such as those described above. The stem 
includes 3 dispersed bulges, one of which occurs in the portion of the stem that includes the 
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RBS. The crRNA element further includes an AUG and a short single-stranded region 
upstream of the AUG between the AUG and one end of the stem. In addition, the crRNA 
element includes a single-stranded portion at its 5' end before the beginning of the stem. 
[0099] The invention therefore provides a nucleic acid molecule comprising (i) a first 
stem-forming portion comprising a cw-repressive sequence; (ii) a second stem-forming 
portion, wherein the two stem-forming portions are complementary or, preferably, 
substantially complementary, and (iii) a non-stem-forming portion that forms a loop 
connecting the 3' end of the first stem-forming portion and the 5' end of the second stem- 
forming portion. The loop sequence preferably includes a YUNR motif. In certain 
preferred embodiments of the invention a stem formed by the two stem- forming portions is 
between approximately 12 and 26 nucleotides in length, e.g., approximately 19 nucleotides 
in length. In certain preferred embodiments of the invention the complementarity of the 
stem-forming portions is between 75% and 95%, e.g., approximately 85%. In certain 
preferred embodiments of the invention the stem comprises at least 2 dispersed areas of non- 
complementarity, e.g., 3 areas of non-complementarity, which may be bulges, mismatches, 
or inner loops. In certain embodiments of the invention the second stem-forming portion 
comprises an RBS. In certain embodiments of the invention the second stem-forming 
portion comprises a Kozak consensus sequence. 

[00100] It is noted that in certain embodiments of the invention the crRNA forms only a 
single loop in its translation-repressing configuration, unlike various naturally occurring 
regulatory systems in which multiple loops are formed. In addition, in certain embodiments 
of the invention the crRNA represses translation in the absence of a ligand. 
[00101] V. Design of 7raK,s-activating RNA Elements 

[00102] As mentioned above, in certain preferred embodiments of the invention the cis- 
repressive sequences and crRNA elements operate in conjunction with additional RNA 
elements, referred to as Jra/w-activating RNA elements (taRNAs) that derepress translation 
that is repressed by the cognate cw-repressive sequence or crRNA element. As shown in 
Figures 1 and 3C, prior to interaction with the cognate m-repressive sequence or crRNA 
element, the taRNA comprises first and second stem-forming portions and a non-stem- 
forming portion, wherein the non-stem-forming portion connects the 3' end of the first stem- 
forming portion and the 5' end of the second stem-forming portion to form a loop, and 
wherein a portion of the taRNA is complementary or, preferably, substantially 
complementary, to a portion of a cognate as-repressive sequence or cognate crRNA. 
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Preferably the first stem-forming portion comprises a portion that is complementary or 
substantially complementary to an RBS. 

[00103] In preferred embodiments of the invention the 5' portion of the taRNA (i.e., the 
portion 5' of the most 5' nucleotide in the first stem-forming portion) comprises a sequence 
that is complementary to the sequence in the loop of a cognate crRNA. In particular, if the 
crRNA loop comprises a YUNR sequence, then preferably the taRNA 5 'region comprises a 
YNAR sequence. The length of the 5 'portion of the taRNA may vary. However, in certain 
embodiments of the invention the length of this portion is less than 100 nucleotides, less 
than 50 nucleotides, less than 25 nucleotides, or less than 10 nucleotides. In certain 
embodiments of the invention the length of the 5' portion of the taRNA is between 5 and 10 
nucleotides. While not wishing to be bound by any theory, it is possible that a longer 5' 
portion may interfere with formation or stability of the crRNA:taRNA duplex or may 
impede access by the ribosome to the region upstream of the ORF, e.g., the RBS (see below 
for discussion of this duplex). 

[00104] In preferred embodiments of the invention the first and second stem-forming 
portions of the taRNA form a stem that is between 6 and 100 nucleotides in length, 
preferably between 10 and 50 nucleotides in length, e.g., between 10 and 40, between 15 
and 30 nucleotides in length, etc. For example, Figure 3 A shows a taRNA structure (taR12) 
in which the stem is 26 nucleotides in length. Preferably if the stem is greater than 
approximately 20 nucleotides in length it includes one or more mismatches, bulges, or inner 
loops. For example, taR12 (see Figure 3 A) includes a two nucleotide inner loop and a 
bulge. In general, the degree of complementarity between the two stem-forming portions 
may be, e.g., between 75% and 95%, approximately 85%, etc. While not wishing to be 
bound by any theory, it is likely that including areas of non-compementarity reduces 
degradation of the RNA as discussed elsewhere herein. In addition, partial instability may 
be important to facilitate the linear-loop interaction between the taRNA and a cognate 
crRNA. 

[00105] When present within a system (e.g., inside a cell) in which translation of an ORF 
is repressed by a cognate ^-repressive sequence, e.g., a cognate crRNA, the taRNA causes 
derepression, allowing translation to occur. As shown in Figure 3C, while not wishing to be 
bound by any theory, it is believed that the taRNA interacts with an RNA comprising a 
cognate cw-repressive sequence (preferably a cognate crRNA) to form a linear-loop 
complex. The linear-loop complex then undergoes further conformational change so that a 
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duplex structure forms between a portion of the taRNA and a complementary or, preferably, 
substantially complementary portion of the c£s -repressive sequence or crRNA The 
conformational change disrupts the stem-loop in which the cw-repressive sequence 
participated, thereby making the region upstream of the ORF (e.g., the region comprising an 
RBS or, in eukaryotes, a region comprising a Kozak consensus sequence) accessible to the 
ribosome. In the case of a prokaryotic system, the ribosome can now gain entry to the RBS 
and translation can proceed. A stem-loop structure remains in the original taRNA. In 
preferred embodiments of the invention in eukaryotic systems, this stem is small enough that 
it does not prevent a scanning ribosome from initiating translation of the downstream ORF. 
For example, in certain embodiments of the invention the stem that exists in the taRNA after 
interaction is 50 nucleotides or less, 40 nucleotides or less, 30 nucleotides or less, 20 
nucleotides or less, or 10 nucleotides or less. 

[00106] Figure 3H shows the structure formed by cognate RNA molecules crR12 and 
taR12 in further detail, indicating base pairing interactions. As shown therein, a duplex 
forms between a portion of the taRNA and a portion of the crRNA that includes some, or 
preferably all of the cis-repressive sequence. Formation of this structure exposes the 
sequence 5' of the start codon (including the RBS), thus derepressing (activating) 
translation. In certain embodiments of the invention the duplex formed between portions of 
the taRNA and crRNA is between 4 and 100 nucleotides in length, or between 6 and 80, 
inclusive. In certain embodiments of the invention the duplex formed between portions of 
the taRNA and crRNA is between 10 and 50 nucleotides in length, inclusive. In certain 
embodiments of the invention the duplex formed between portions of the taRNA and crRNA 
is between 20 and 30 nucleotides in length, inclusive. In cases where the duplex is more than 
approximately 20 nucleotides in length, preferably it includes one or more areas of non- 
complemenarity, e.g., in order to reduce degradation by RNAse molecules present within a 
cell. For example, the duplex in Figure 3H is 26 nucleotides in length and includes two 
dispersed mismatches. It will be appreciated that the taRNA can operate in conjunction 
with a c^-repressive sequence that represses translation as described above, wherein the cis- 
repressive sequence is not necessarily within a crRNA element. 

[00107] It will be appreciated that in accordance with the invention any particular taRNA 
will operate to derepress translation only when translation is repressed by a suitable cognate 
crRNA, as opposed to when translation is repressed by a noncognate crRNA. The inventors 
showed that translation that was repressed by crRlO was activated in the presence of taRlO 
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(the cognate taRNA for crRlO) by approximately 5-fold relative to the level of translation in 
the absense of taRlO. Even more strikingly, translation that was repressed by crR12 was 
activated in the presence of taR12 (the cognate taRNA for crR12) by 10-fold relative to the 
level of translation in the absense of taR12. When background autofluorescence was 
subtracted as described above for crRNA calculations, translation that was repressed by 
crRlO was activated in the presence of taRlO by 8-fold relative to the level in the absence of 
taRlO, and translation that was repressed by crR12 was activated in the presence of taR12 
by 19- fold relative to the level of translation in the absence of taR12. Noncognate taRNAs 
had no effect on translation. Thus in preferred embodiments of the invention translation 
repressed by a cw-repressive sequence or crRNA is activated by at least 5-fold by the 
cognate taRNA. In certain embodiments of the invention translation repressed by a cis- 
repressive sequence or crRNA is activated by at least 10-fold by the cognate taRNA. In 
certain embodiments of the invention translation repressed by a cis-repressive sequence or 
crRNA is activated by at least 19-fold by the cognate taRNA. 

[00108] In order to demonstrate the specificity of the crRNA:taRNA interaction, the 
inventors measured the equilibrium association constants (Ka) between various 
crRNA:taRNA pairs. As shown in Table 2, the measured Ka values for non-cognate pairs 
were approximately an order of magnitude or more lower than the values for cognate pairs. 
Thus in certain preferred embodiments of the invention the equilibrium association constant 
between cognate crRNA: taRNA pairs is at least 0.5 x 10 7 kcal/mol. In other embodiments 

7 7 

of the invention the equilibrium association constant is between 0.5 x 10 and 3.0 x 10 
kcal/mol, inclusive. In other embodiments of the invention the equilibrium association 
constant is between 0.5 x 10 7 and 2.0 x 10 7 kcal/mol, inclusive. In other embodiments of the 
invention the equilibrium association constant is between 0.8 x 10 7 and 1.5 x 10 7 kcal/mol 
In other embodiments of the invention the equilibrium association constant is approximately 
1.0 x 10 7 , 1.1 x 10 7 , or 1.2 x 10 7 kcal/mol. 
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[00109] Table 2: Specificity of taRNA-crRNA Interactions. K A : Equilibrium association 
constant (Mol -1 ) measured by in vitro experiments. AFL1 represents the fold change in 
fluorescence (arbitrary fluorescence units) in the presence of taRNA (+arabinose/- 
arabinose). 
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[001 10] In certain preferred embodiments of the invention the crRNA and taRNA 
sequences each have only a single predicted secondary structure rather than multiple 
predicted secondary structures. A number of computer programs are available to predict 
secondary structure (e.g., Mfold™, RNAfold™, etc.) One of ordinary skill in the art will be 
able to select and apply a suitable program for RNA structure prediction when designing 
crRNA and taRNA molecules in accordance with the principles described herein. 
[00111] VI DNA Templates, Constructs, Plasmids. Cells, and Kits 
[00112] Although the invention was described above primarily in reference to RNA, the 
nucleic acid molecules of the invention can be RNA or DNA. In general, RNA and DNA 
molecules can be produced using in vifro systems, within cells, or by chemical synthesis 
using methods well known in the art. It will be appreciated that insertion of c/s-repressive 
sequences, crRNA elements, etc., upstream of an open reading frame will typically be 
accomplished by modifying a DNA template for transcription of the ORF. The invention 
therefore provides DNA templates for transcription of a crRNA or taRNA. The invention 
also provides DNA constructs and plasmids comprising such DNA templates. In certain 
embodiments of the invention the template for transcription of a crRNA is operably 
associated with a promoter. In particular, the invention provides a DNA construct 
comprising (i) a template for transcription of a crRNA; and (ii) a promoter located upstream 
of the template. In certain embodiments of the invention a construct or plasmid of the 
invention includes a restriction site downstream of the 3' end of the portion of the construct 
that serves as a template for the crRNA, to allow insertion of an ORF of choice. The 
construct may include part or all of a polylinker or multiple cloning site downstream of the 
portion that serves as a template for the crRNA. The construct may also include an ORF 
downstream of the crRNA oortion. The invention nrovides a DNA construct comprising (i) 
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a template for transcription of a taRNA; and (ii) a promoter located upstream of the 
template. The invention further provides a DNA construct comprising: (i) a template for 
transcription of a crRNA; (ii) a promoter located upstream of the template for transcription 
of the crRNA; (iii) a template for transcription of a taRNA; and (iv) a promoter located 
upstream of the template for transcription of the taRNA. The promoters may be the same or 
different. 

[00113] The DNA constructs may be incorporated into plasmids, e.g., plasmids capable 
of replicating in bacteria. In certain embodiments of the invention the plasmid is a high 
copy number plasmid (e.g., a pUC-based or pBR322-based plasmid) while in other 
embodiments of the invention the plasmid is a low copy number plasmid (36). The plasmid 
may include any of a variety of origins of replication, which may provide different copy 
numbers. For example, any of the following may be used (copy numbers are listed in 
parenthesis): ColEl (50-70 (high)), pl5A (20-30 (medium)), pSClOl (10-12 (low)), 
pSOOl* (< 4 (lowest). It may be desirable to use plasmids with different copy numbers for 
transcription of mRNA to be post-transcriptionally regulated and/or for transcription of 
taRNA elements to achieve an additional level of control over gene expression. In addition, 
in certain embodiments of the invention a tunable copy number plasmid is employed (72). 
Figures 5 A and 5B show plasmids that provide templates for transcription of a crRNA and a 
taRNA molecule respectively. Figure 5C shows a plasmid that provides templates for 
transcription of both a crRNA and a taRNA molecule. 

[00114] The invention further provides viruses and cells comprising the nucleic acid 
molecules, DNA constructs, and plasmids described above. In various embodiments of the 
invention the cell is a prokaryotic cell. In various embodiments of the invention the cell is a 
eukaryotic cell (e.g., a fungal cell, mammalian cell, insect cell, plant cell, etc.). The nucleic 
acid molecules or DNA constructs may be integrated into a viral genome using recombinant 
DNA technology, and infectious virus particles comprising the nucleic acid molecules 
and/or templates for their transcription can be produced. The nucleic acid molecules, DNA 
constructs, plasmids, or viruses may be introduced into cells using any of a variety of 
methods well known in the art, e.g., electroporation, calcium-phosphate mediated 
transfection, viral infection, etc. (See, e.g., 47). As discussed further below, the DNA 
constructs can be integrated into the genome of a cell. In general, the cells of the invention 
may be present in culture or in an organism. If present within a human being, the cells are 
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not part of the human being, thereby avoiding any interpretation of the claims of the 
invention that may be construed as claiming a human being or portion thereof. 
[00115] The invention further provides transgenic plants and non-human transgenic 
animals comprising the nucleic acid molecules, DNA constructs, and/or plasmids of the 
invention. Methods for generating such transgenic organisms are well known in the art. 
[00116] The invention further provides a variety of kits for implementation of the 
riboregulator system. For example, the invention provides a kit comprising two plasmids, 
wherein the first plasmid comprises (i) a template for transcription of a cw-repressive RNA 
element; and (ii) a promoter located upstream of the template for transcription of the cis- 
repressive RNA element, and wherein the second plasmid comprises (i) a template for 
transcription of a cognate /rans-activating RNA element; and (ii) a promoter located 
upstream of the template for transcription of the trans- activating RNA element. The 
promoters may be the same or, preferably, different. One or more of the promoters may be 
inducible. The plasmids may have the same or different copy numbers. The invention 
further provides a kit comprising a single plasmid that comprises a template for transcription 
of a cw-repressive RNA element and a promoter located upstream of the template for 
transcription of the m-repressive RNA element and further comprises a template for 
transcription of a cognate fra^s-activating RNA element and a promoter located upstream of 
the template for transcription of the cognate /raws-activating RNA element In certain 
embodiments of the invention the plasmids comprise one or more restriction sites 
downstream of the template for transcription of the czs-repressive RNA element for insertion 
of an open reading frame of choice. The kits may further include one or more of the 
following components: (i) one or more inducers; (ii) host cells (e.g, prokaryotic or 
eukaryotic host cells); (iii) one or more buffers; (iv) an enzyme, e.g., a restriction enzyme; 
(v) DNA isolation and/or purification reagents; (vi) a control plasmid lacking a crRNA or 
taRNA sequence; (vii) a control plasmid containing a crRNA or taRNA sequence or both; 
(viii) sequencing primers; (ix) instructions for use. The control plasmids may comprise a 
reporter sequence. 

[00117] The invention further provides oligonucleotides comprising a crRNA sequence 
and oligonucleotides comprising a taRNA sequence. In addition, the invention provides sets 
of two or more oligonucleotides. A first set of oligonucleotides includes two or more 
oligonucleotides whose sequences together comprise a crRNA sequence. The invention also 
provides a second set of oligonucleoties whose sequences together comprise a taRNA 



33 



WO 2004/046321 



PCT/US2003/036506 



sequence. For ease of cloning, it may be preferable to employ two oligonucleotides each of 
which includes a single stem-forming portion, in different cloning steps, rather than a single 
oligonucleotide comprising two stem-forming portions, in order to avoid formation of a stem 
within the oligonucleotide, which may hinder cloning (see Example 1). The 
oligonucleotides may be provided in kits with any of the additional components mentioned 
above. The oligonucleotides may include restriction sites at one or both ends. 
[00118] VII. Components for Riboregulator Systems 

[00119] The sections above described an implementation using two different promoter 
pairs to drive transcription of the crRNA and taRNA and employed a single consensus 
ribosome binding site. This section describes a number of variations suitable for use in 
various embodiments of the invention. However, the invention is not limited to these 
particular embodiments. 
[00120] A. Ribosome binding site 

[00121] The riboregulators described above employed a consensus prokaryotic RBS. 
However, in various embodiments of the invention any of a variety of alternative sequences 
may be used as the RBS. The sequences of a large number of bacterial ribosome binding 
sites have been determined, and the important features of these sequences are known (see 
53, 54, 55 and references therein, which are incorporated by reference herein). Preferred 
RBS sequences for high level translation contain a G-rich region at positions -6 to -1 1 with 
respect to the AUG and typically contain an A at position -3. Exemplary RBS sequences 
for use in the present invention include, but are not limited to, AGAGGAGA (or 
subsequences of this sequence, e.g., subsequences at least 6 nucleotides in length, such as 
AGGAGG. Shorter sequences are also acceptable, e.g., AGGA, AGGGAG, GAGGAG, etc. 
Numerous synthetic ribosome binding sites have been created, and their translation initiation 
acitivy has been tested (53). In various embodiments of the invention any naturally 
occurring RBS may be used in the crRNA and taRNA constructs. Any of the RBS 
sequences provided in (53), or, shorter versions thereof (e.g.., the first 6 nucleotides, the first 
8 nucleotides, or the first 10 nucleotides) may also be used. The activity of any candidate 
sequence to function as an RBS may be tested using any suitable method. For example, 
expression may be measured as described in Example 1, or as described in reference 53, 
e.g., by measuring the activity of a reporter protein encoded by an mRNA that contains the 
candidate RBS appropriately positioned upstream of the AUG. Preferably an RBS sequence 
for use in the invention supports translation at a level of at least 10% of the level at which 
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the consensus RBS supports translation (e.g., as measured by the activity of a reporter 
protein). For example, if the candidate RBS is inserted into the control plasmid described in 
Example 1 in place of the consensus RBS, the measured fluorescence will be at least 10% of 
that measured using the consensus RBS. In certain embodiments of the invention an RBS 
that supports translation at a level of at least 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 
70%, 80%, 90%, or more relative to the level at which the consensus RBS supports 
translation is used. In certain embodiments of the invention an RBS that supports translation 
at higher levels than the consensus RBS is used. If an alternative RBS is selected, the cis- 
repressive sequence and taRNA sequence are modified to be complementary to the 
alternative RBS. 
[00122] B. Promoters 

[00123] A large number of different promoters that operate in prokaryotic cells are 
known and can be used to drive transcription of mRNAs comprising crRNA elements and/or 
transcription of taRNA elements in various embodiments of the invention. As described 
herein, inducible promoters such as the PL(tetO), pBAD, and PL(lacO) promoters are used. 
Other synthetic promoters that may be used include PAllacO-1 and Plac/ara-1. Phage 
promoters such as SP6, T3, or T7 can also be used. Other suitable promoters include, 
without limitation, any of the responsive or consitutive promoters listed in Table 3. In 
general, the level of transcription driven by a "responsive promoter" varies depending on, or 
in response to, environmental conditions or stimuli, or changes in such conditions or stimuli. 
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[00124] Table 3: Responsive and Constitutive Prokaryotic Promoters 



ResDonsive Promoters 


Compound / Condition Sensed 


Panr 


Anaerobicity 


T*cspA 


Cold Shock 


?Eo32 


Heat Shock 


?ibp 


Cytoplasmic Stress 


VoxyR 


Oxidative Stress 


PlexA 


SOS (DNA damage) 


FrecA 


SOS (DNA damage) 


VphoB 


Phosphate Starvation 


¥ada 


DNAAlkylation 


VdmpR 


Aromatic Compounds 


?u 


Toluene-based Compounds 


VbphAl 


Polychlorinated biphenyls (PCBs) 


VmerTPAD 


Mercury 


VpcoE 


Copper 


?pbrA 


Lead 


Ycad 


Cadmium 


r nis 


Histidine 


rscri 


Sucrose 


ralfco 


Middle-chain Alkanes 


riuxi 


N-Acvl homoserine lactones 


srjlgMJy, xjlgAMJy 


Flagellation 


?JlhD 


Flagellation, Motility and Chemotaxis 


?uhpABCT 


Sugar Phosphate Uptake 


Constitutive Promoters 




Pjpc 


Ribosomal Protein Operon 


PL 


From Phage Lambda 


P; 


RrnB Ribosomal RNA operon 


?2 


RmB Ribosomal RNA operon 


?lpp 


Promoter for major outer membrane 
lipid protein gene 



[00125] Any of a wide variety of promoters can be used in eukaryotic cells (e.g., fungal, 
insect, plant, or mammalian cells) to drive transcription of the mRNA containing the cis- 
repressive RNA element and the /rajw-activating RNA element. Suitable promoters include, 
without limitation, constitutive promoters (e.g., actiiu tubulin), inducible promoters, GAL 
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promoters, viral LTR Gong terminal repeat) promoters, CMV promoter (cytomegalovirus), 
RSV promoter (Rous sarcoma virus), S V40 promoter, cauliflower mosaic virus promoter 
(CaMV), Vlambdal promoter, EF1 alpha promoters, cell cycle regulated promoters (e.g, 
cyclin A, B, C, D, E, etc). Suitable inducible promoters include steroid responsive 
promoters, metal-inducible promoters (e.g., metallothionine promoter), the tet system (67, 
71) etc. Non-limiting examples of tissue-specific promoters appropriate for use in 
mammalian cells include lymphoid-specific promoters (see, for example, Calame et al., Adv. 
Immunol. 43:235, 1988) such as promoters of T cell receptors (see, e.g., Winoto et al., 
EMBOJ. 8:729, 1989) and immunoglobulins (see, for example, Banerji et al., Cell 33:729, 
1983; Queen et al., Cell 33:741, 1983), and neuron-specific promoters (eg., the 
neurofilament promoter; Byrne et al., Proc. Natl Acad. Sci. USA 86:5473, 1989). 
Developmentally-regulated promoters may also be used, including, for example, the murine 
hox promoters (Kessel et al., Science 249:374, 1990) and the a-fetoprotein promoter 
(Campes et al., Genes Dev. 3:537, 1989). One of ordinary skill in the art will be able to 
select appropriate promoters depending, e.g., upon the particular cell type in which the cis 
and trans elements of the invention are to be employed. 
[00126] VDI. In Vitro Selection of Additional crRNA:taRNA Cognate Pairs 
[00127] It will be appreciated that using the guidelines described herein one of ordinary 
skill in the art will be able to readily design and generate additional c/s-repressive 
sequences, crRNA elements, and cognate taRNA elements, including elements that exhibit a 
a variety of different levels of repression and derepression. The invention provides a variety 
of different methods for so doing. For example, one of ordinary skill in the art will 
appreciate that by changing one nucleotide in a first stem-forming portion and making a 
compensatory change in the second stem-forming portion (so that the resulting nucleotides 
still form either a complementary or non-complementary pair as in the additional structure), 
the structure and thermodynamic properties of the resulting structure remain largely the 
same. Thus by beginning with known crRNA:taRNA cognate pairs, one can generate a 
family of related cognate pairs by systematically altering pairs of nucleotides. In addition, 
making small changes, e.g., engineering an additional 1 nucleotide bulge, increasing the 
length of the stem-forming portion by one or two nucleotides, etc., will result in 
crRNA:taRNA pairs with similar properties to the parent pair. In making such changes it 
will generally be desirable to retain features such as the presence of dispersed areas of non- 
complementarity, the approximate overall percent complementarity, the approximate 
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equilibrium association constant of the pairs, etc. Thus the invention specifically 
encompasses variants of crRlO, crR12, taRlO, and taR12 that differ from these molecules by 
10 or fewer nucleotides, i.e., molecules that can be derived from crRlO, crR12, taRlO, or 
taR12 by making 12 or fewer (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, or 12) additions, 
substitutions, or deletions of a nucleotide. The ability of any crRNA:taRNA pair to repress 
or activate translation, respectively, maybe readily tested, e.g., as described in Example 1. 
[00128] A further aspect of the invention is a method for generating large numbers of 
additional riboregulator pairs using an in vitro selection process. This method (which may 
be referred to as "directed evolution") can result in generation of a very large number of 
specific riboregulator pairs. According to one embodiment of the inventive method one 
begins with the sequences of a riboregulator pair that has been shown to function to repress 
and derepress translation (e.g., the crR10:taR10 or crR12:taR12 pair). An initial pool of 
randomized molecules is generated based on these sequences (e.g. as described in 22, 23, 68, 
69) in which the nucleotides that participate in the crRNAtaRNA interaction are targeted for 
randomization (e.g, the 26 nucleotides that form a stem when crRlO and taRlO or crR12 and 
taR12 interact). Randomization is typically performed using a PCR step, e.g., employing 
error-prone PCR (78, 79). Thus the starting templates for the in vitro selection process are 
typically DNA constructs that comprise templates for the initial crRNA:taRNA pair. In 
general, any of a variety of other methods may be used to achieve randomization including, 
but not limited to, DNA shuffling (80, 81), cassette mutagenesis (82), degenerate 
oligonucleotide directed mutagenesis (83, 84), sticky feet mutagenesis (85), and random 
mutagenesis by whole plasmid amplification (86). If desired, multiple rounds of 
randomization can be performed. 

[00129] Following randomization, the crRNA and taRNA randomized sequences are 
amplified (e.g., using PCR) to incorporate a promoter for in vitro transcription (e.g., a T7 or 
SP6 promoter) at the 5' end. An in vitro transcription reaction is then performed using the 
products, in order to synthesize separate pools of crRNA and taRNA transcripts for use in 
subsequent selection steps. 

[00130] Portions of the crRNA transcript pool are dispersed into individual vessels, e.g., 
multiwell plates. Portions of the taRNA transcript pool are also added to the vessels so that 
each vessel contains a plurality of different crRNA transcripts and a plurality of different 
taRNA transcripts. The taRNAs may be added at elevated concentrations relative to that 
which would typically be achieved within cells, e.g., concentrations optimized for cognate 
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pair binding vs binding of noncognate pairs (see Example 3). Pairing between cognate 
crRNA and taRNA elements is allowed to occur. A labeled reverse transcription primer 
probe (e.g. a Cy5 labeled probe as described in Example 3) is added, and RT-PCR is 
performed. The RT-PCR products are then run on a gel. RT-PCR generates two main 
populations of detectable RNA species: (i) the crRNA molecule alone and (ii) the 
crRNA:taRNA complex. Pairs that show both RNA species are selected. 
[00131] Standard sequencing reactions are performed for each selected pair. The selected 
crRNA and taRNA sequences are then analyzed using any available algorithm for prediction 
of secondary structure (e.g., MFOLD, RNAFOLD). If desired, structures may be examined 
to determine whether they meet certain of the guidelines for effective crRNA: taRNA 
elements described above. Selected crRNA and taRNA elements are cloned into appropriate 
vectors (e.g., those presented in Figure 5) in place of the crRNA and taRNA elements. Gene 
expression experiments are then performed to determine whether the crRNA elements 
effectively repress translation of a downstream ORF and whether repression is relieved by 
the cognate taRNA. It will be appreciated that a number of variations on the above can be 
made. For example, alternative methods for assessing the interaction between candidate 
crRNA:taRNA pairs can be used (see Example 3, which mentions alternate methods for 
measuring association constants). In addition, it will be appreciated that it is not necessary 
to randomize both the crRNA and taRNA sequences. For example, it may be desirable to 
use a single crRNA sequence and to randomize only the taRNA sequence, in order to 
generate a set of taRNA molecules that display a range of abilities to activate translation that 
is repressed by the crRNA molecule. Conversely, it may be desirable to use a single taRNA 
sequence and to randomize only the crRNA sequence in order to generate a set of crRNA 
molecules that can be derepressed to different extents by the same taRNA molecule. 
[00132] DC. Increasing Riboregulator Flexibility 
[00133] A. Use of Responsive Promoters 

[00134] A variety of approaches may be employed to enhance the flexibility of the 
riboregulator systems of the present invention. For example, by placing transcription of the 
taRNA element under control of a responsive promoter, such as an endogenous cellular 
promoter that is responsive to an environmental or developmental stimulus (e.g., the 
presence of a small molecule, metabolite, nutrient, hormone, cell density signal, etc.), 
activation of translation by the taRNA in turn becomes responsive to that stimulus. By 
incorporating a single crRNA element into an mRNA upstream of the open reading frame 
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and by driving transcription of the cognate taRNA element from a plurality of different 
promoters, each responsive to a different environmental or developmental stimulus, 
translation of the mRNA is in turn made responsive to each of these stimuli. This type of 
control is referred to as "many to one" control since many stimuli affect translation of one 
mRNA. Conversely, a single crRNA element may be positioned upstream of the ORF in a 
plurality of different mRNAs. Transcription of the cognate taRNA causes activation of 
translation of the plurality of mRNAs. This type of control is referred to as "one to many" 
control. 

[00135] By combining these two approaches, "many to many" control can be achieved 
using only a single cognate crRNA:taRNA pair. For example, a single crRNA element may 
be positioned upstream of the ORF in a plurality of different mRNAs, and transcription of 
the cognate taRNA may be placed under control of a plurality of different promoters, each 
responsive to an environmental or developmental stimulus. Occurrence of any of the stimuli 
activates transcription of the taRNA, which then activates translation of all of the ORFs that 
contain the crRNA element upstream of the ORF. Thus any of a variety of inputs can result 
in a single, coordinated output involving translation of multiple different ORFs. Yet further 
flexibility can be achieved by using a plurality of different cognate crRNA:taRNA pairs. 
Thus it is possible to extensively modify existing genetic networks, to integrate new 
components into such networks, or to create entirely artificial genetic networks of 
considerable complexity using the riboregulator systems described herein. 
[00136] B. Translational Control Using Single Plasmid, Multiple Plasmid.or 
Chromosomallv Integrated CrRNA and TaRNA Elements 

[00137] As mentioned above, the invention provides plasmids comprising templates for 
transcription of crRNA and taRNA elements. In general, crRNA and taRNA elements may 
be introduced into a cell on separate plasmids, or a single plasmid containing one or more 
crRNA and/or taRNA elements can be introduced into a cell. Thus a plasmid may contain 
one or more crRNA elements, one or more taRNA elements (which may be cognate to the 
same crRNA element or to different crRNA elements), or both crRNA and taRNA elements. 
Generally each crRNA and/or taRNA element is operably linked to a promoter. The same 
promoter may drive transcription of multiple elements, or different promoters may be used 
for different elements. In general, it will be desirable to employ different promoters for the 
crRNA and taRNA elements of a cognate pair. The crRNA elements may be positioned 
upstream of a site for convenient insertion of an open reading frame, e.g., a restriction site or 
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polylinker. The plasmid may further comprise one or more open reading frames positioned 
downstream from a crRNA element, preferably in frame with the start codon. Figure 5 
presents representative examples of plasmids provided by the invention. It is to be 
understood that the gfp sequence may be replaced by any open reading frame of choice or 
such open reading frame can be added to create a larger open reading frame encoding a 
fusion protein. The plasmids may also include sequences encoding tags, e.g, His tag, FLAG 
tag, HA tag, Myc tag, etc., to enable purification of a protein encoded by a reading frame in 
frame with the tag sequences. 

[00138] In addition to providing the crRNA and taRNA elements on plasmids, a DNA 
construct that provides a template for transcription of one or more crRNA and/or taRNA 
elements (and, optionally, an open reading frame downstream of a crRNA element) may be 
integrated into the genome of a cell. In general, such constructs may be integrated at 
random locations. Alternatively, the constructs may be integrated at specific locations, e.g., 
at regions of homology to the construct. For example, if the construct comprises a promoter 
' and/or ORF that is homologous to endogenous cellular DNA, the construct may be inserted 
so as to replace the endogenous DNA. Methods for inserting DNA sequences into the 
genome of prokaryotic cells and for targeting such DNA sequences for insertion at specific 
locations are well known in the art (73, 74). Methods for inserting DNA sequences into the 
genome of eukaryotic cells are also well known in the art. Standard transfection or viral 
infection methods may be used to achieve random integration. Alternately, homologous 
recombination may be used to integrate DNA sequences into the genome of eukaryotic cells 
and/or to generate transgenic non-human mammals in which an endogenous DNA sequence 
is replaced by the DNA construct (75 - 77). 
[00139] C. Ligand-Responsive Aptamer Domains 

[00140] A number of naturally occurring mRNA molecules have been shown to bind to 
small molecules such as thiamine, coenzyme B12, flavin mononucleotide, etc., causing 
allosteric rearrangment of the mRNA, which results in modulation of gene expression. Such 
RNAs exist in both prokaryotic and eukaryotic cells (10, 1 1, 16, 70). The inventors have 
recognized that by incorporating specific ligand-binding domains into the crRNA and 
taRNA elements of the invention, these elements can be made responsive to the presence or 
absence of the ligand. Therefore, in certain embodiments of the invention the crRNA or 
taRNA comprises a domain that responds to an endogenous or exogenous signal. 
Riboregulators that include such a domain are referred to as responsive riboregulators. 
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Signals to which responsive riboregulators respond include, for example, (i) small 
molecules; (ii) metabolites; (iii) nutrients; (iv) metal ions; (v) cell density signals. 
[00141] As discussed above, in vitro selection has been used to isolate nucleic acid 
sequences (aptamers) that bind small molecules with a high degree of affinity and specificity 
(64, 68, and 69 and references therein). Binding of the small molecule ligand can alter the 
structure of the aptamer, and this alteration may be used to control translation. For example, 
insertion of an RNA aptamer that specifically bind to aminoglycosides into the 5' UTR of an 
RNA allowed its translation to be repressible by lignad addition (64). By incorporating a 
ligand-specific aptamer into the taRNA elements of the invention, their ability to activate 
translation can be made responsive to presence of the ligand. 

[00142] In accordance with these embodiments of the invention an RNA aptamer that 
binds to a particular molecule of interest is selected using established in vitro selection 
techniques as described above. The aptamer is incorporated into the taRNA. Binding of the 
ligand induces a conformational change in the taRNA that allows or enhances the interaction 
between the crRNA and the taRNA, thereby activating translation. In these embodiments of 
the invention the taRNA may be present constitutively within a cell but is inactive in the 
absence of the ligand. Ligand-specific aptamers can also be incorporated into the crRNA 
elements of the invention and/or used in conjunction with cis-repressive sequences to allow 
increased control over gene expression. 
[00143] D. Exogenous Delivery 

[00144] The description above referred primarily to applications of the riboregulator 
elements that involved their transcription within cells. However, according to certain 
embodiments of the invention either the crRNA element with a downstream ORF, the 
taRNA element, or both, is synthesized in vitro and delivered to a cell. In most cases the 
crRNA element is transcribed within a cell and the taRNA element is delivered 
exogenously. For such applications it may be desirable to synthesize a riboregulator 
element, e.g., a taRNA element using either DNA or a combination of DNA and RNA. It 
may also be desirable to employ various nucleotide analogs in order, for example, to 
increase the stability and/or nuclease resistance of the molecule. In addition, such 
modifications and analogs may be used to alter the base-pairing properties of the molecule 
as desired. 

[00145] According to certain embodiments of the invention various nucleotide 
modifications and/or analogs are used. Numerous nucleotide analogs and nucleotide 
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modifications are known in the art, and their effect on properties such as hybridization and 
nuclease resistance has been explored. (In general, nucleotide analogs and modified 
nucleotides will be referred to herein as "nucleotide analogs".) For example, various 
modifications to the base, sugar and internucleoside linkage have been introduced into 
oligonucleotides at selected positions, and the resultant effect relative to the unmodified 
oligonucleotide compared. A number of modifications have been shown to alter one or 
more aspects of the oligonucleotide such as its ability to hybridize to a complementary 
nucleic acid, its stability, etc . For example, useful 2-modifications include halo, alkoxy and 
aUyloxy groups. US patent numbers 6,403,779; 6,399,754; 6,225,460; 6,127,533; 6,031,086; 
6,005,087; 5,977,089, and references therein disclose a wide variety of nucleotide analogs 
and modifications that may be of use in the practice of the present invention. See also 
Crooke, S. (ed.) "Antisense Drug Technology: Principles, Strategies, and Applications" (1 st 
ed), Marcel Dekker; ISBN: 0824705661; 1st edition (2001) and references therein. As will 
be appreciated by one of ordinary skill in the art, analogs and modifications may be tested 
using, e.g., the assays described herein or other appropriate assays, in order to select those 
that effectively regulate translation. Additional modifications such as addition of 
polyethylene glycol (PEG), e.g., to increase stability, can be used. 
[00146] A variety of methods can be used to introduce riboregulator elements into cells, 
particularly into eukaryotic cells. Numerous agents that facilitate uptake of oligonucleotides 
and of DNA constructs by cells are known in the art and include various lipids, e.g., cationic 
lipids such as Oligofectamine™, polymers, e.g., cationic polymers, etc. In general, any of 
the reagents used for RNA or DNA delivery in culture or in vivo (e.g., materials for use in 
gene therapy) may be used. 
[00147] E. Additional Cis Elements 

[00148] By adding one or more additional cw-repressive sequences to the cis-repressive 
RNA elements described above, it is possible to obtain more finely grained control over 
expression. In particular, it is possible to obtain an intermediate expression level using two 
different cis elements both of which interact with the same cognate taRNA, or with different 
taRNAs. For example, as shown in Figure 9, an additional sequence (labeled p) can be 
added upstream of a as-repressive sequence (labeled a). The additional sequence is 
complementary or substantially complementary to the c/s-repressive sequence. In Figure 9, 
the as-repressive sequence contains a portion that is complementary or substantially 
complementary to the RBS. This sequence will generally not form a stem with sequences 
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between its 3* end and the beginning of the ORF. Thus in Figures 9 A and 9B, translation is 
not repressed, while in Figure 9C, in the presence of the cis-repressive sequence, translation 
is repressed. When the additional sequence is added, it can form a stem-loop structure with 
the czj-repressive sequence, thereby preventing the cw-repressive sequence from 
sequestering the RBS, as shown in the lower portion of Figure 9D. Thus two alternate 
structures are possible. In one structure (upper portion of Figure 9D), the c/s-repressive 
sequence forms a stem that sequesters the RBS, while in the alternate structure the cis- 
repressive sequence forms a stem with the additional sequence, allowing the ribosome to 
access the RBS. While not wishing to be bound by any theory, it is likely that at any point 
in time individual mRNAs will assume one structure or the other, and a single mRNA may 
switch back and forth between the two structures. Thus it is evident that in a population of 
mRNAs or for any individual mRNA, translation will be possible during a fraction of the 
time. Thus an intermediate level of translation occurs, as shown in Figure 9D. This general 
principle can be extended, e.g., by incorporating additional sequences into the repressive 
RNA elements of the invention, to allow the formation of a range of different stem-loop 
structures, which may exhibit greater or less stability than a stem-loop formed between the 
cw-repressive sequence and sequences between the 3' end of the cis-repressive sequence and 
the 5' end of the ORF (or the 5' portion of the ORF). Alternatively, multiple transacting 
RNAs, each of which can be unique, can be targeted for the corresponding cis elements. 
[00149] X. Additional Features and Applications of the Riboregulators 
[00150] As described herein, the inventors have created post-transcriptional control 
elements that circumvent the need for specific promoters, genes, or regulators, and can be 
utilized as control modules in genetic circuits to investigate additional layers of gene 
regulation. Given their scalability and specificity of interaction, the number of elements and 
their range of functions can be greatly expanded by in vitro selection techniques as 
described above (22-24), creating a large collection of interactive ribregulators. Such an 
assembly could generate in vivo cascades of highly specific riboregulator or riboswitch 
networks, which may respond faster, conserve more energy (43), and be more complex than 
networks based solely on DNA-protein components. Linking these switches with 
endogenous riboregulators and switches (10-16) and cell-cell signaling molecules, would 
further broaden their utility. Such, post-transcriptional control systems will also be a 
valuable tool in resolving the complexity of large-scale gene networks, since current studies 
rely on evaluating global patterns of gene expression or constructing synthetic networks, 
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which have been limited to well-characterized transcription factors. For example, the use of 
riboregulators could selectively perturb networks of unknown structure and reveal functional 
properties of genetic networks. 

[00151] The work described herein, which details positive and negative post- 
transcriptional control, elucidates the action of cis and trans acting regulatory RNAs. While 
not wishing to be bound by any theory, the inventors find that conformational changes in 
RNA structures and stable duplex formation not only depend on the initial recognition 
complex, but also on the ability of trans activators to bind to nucleotides in the partially 
destabilized stem structure. In the system described herein, the specificity of intermolecular 
RNA interaction arises from unique sequences in the crRNA stem and not the consensus 
sequence of the recognition loop. Studies of artificial riboregulators and switches of this 
sort can be a valuable method of characterizing potential modes of action of sRNAs, which 
have been implicated as regulators of transcription, translation, and modulators of 
developmental switches. In addition, this work may further motivate ongoing sequence- and 
structure- based efforts to identify novel sRNAs, particularly trans activators, in both 
prokaryotes and eukaryotes. Ultimately, the versatility of artificial riboregulators and 
switches may also yield additional insights into RNA-based cellular processes and RNA's 
evolutionary role in biology (1,2). 

[00152] The riboregulators of the invention find use in a wide variety of contexts and 
possess features that distinguish them from other available systems for control of gene 
expression. In general, the riboregulators are useful for any of the wide variety of 
applications for which inducible and repressible promoter systems are used. The 
riboregulators may provide a faster response than could be achieved by placing a gene under 
control of an inducible promoter. Unlike regulation that involves activating or repressing 
transcription of a full-length mRNA, the present invention requires transcription (or 
exogenous administration) of a short RNA segment (the taRNA), which then relieves 
translational repression of a pre-existing mRNA. In addition, the riboregulator system does 
not require replacement of the endogenous promoter, thus physiologic levels of transcription 
and transcriptional responses to environmental and developmental stimuli can be 
maintained. This is typically impossible with currently available inducible promoter 
systems. Furthermore, the riboregulators of the invention may be used in conjunction with 
trancriptional control elements (e.g., regulatable promoters), to achieve a greater dynamic 
range (i.e., a greater range of expression levels) than could otherwise be achieved. In 
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addition, the riboregulator system can be used to control expression of a single transcription 
unit within an operon. 

[00153] By providing the same crRNA element upstream of a plurality of different open 
reading frames, these reading frames may be coordinately regulated in response to a single 
stimulus. For example, a single crRNA element may be positioned upstream of a set of 
open reading frames. By providing the cognate taRNA (e.g., by inducing its transcription or 
exogenously), translation of the set of open reading frames will be coordinately activated. 
For example, a particular crRNA sequence may be positioned upstream of a plurality of 
open reading frames coding for proteins that are involved in a single biological process (e.g., 
a developmental process, a response to an environmental stress, etc.). Expression of the 
entire set of proteins may be activated by a single taRNA. Thus the taRNA may act as a 
master control switch. The taRNA may be delivered exogenously, or its transcription may 
be induced. Alternately, a template for transcription of the taRNA may be inserted 
downstream of a plurality of different promoters, e.g., promoters that respond to 
environmental or developmental stimuli, so that these stimuli will cause transcription of the 
taRNA and activation of translation. In yet another variation, a responsive taRNA may be 
used. In this case, presence of the appropriate activating ligand or environmental condition 
activates the taRNA, which then binds to the cognate crRNA present upstream of the open 
reading frames, thereby derepressing translation. 

[00154] The riboregulators may function as switches, e.g., on-off switches or may 
provide a graded response. They may operate within genetic networks (either synthetic or 
natural genetic networks) and/or provide a link between synthetic and natural genetic 
networks. They may be used to introduce perturbations into networks of unknown structure 
in order to reveal natural network connectivity. This allows the identification of key 
components of such networks, which may provide suitable therapeutic targets for treatment 
of diseases and conditions in which such networks malfunction. With the increasingly rapid 
acquisition of genetic information and powerful new experimental techniques the ability to 
construct, analyze, and interpret qualitative and quantitative models is becoming more and 
more important (45). The ability to analyze and perturb natural genetic networks and to 
create such networks using tools such as the riboregulators of the present invention is 
important for the engineering of artificial gene regulatory networks (see 63 for a review of 
the engineering of gene regulatory networks). 
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[00155] A particular use of the riboregulator systems is to determine the effect on global 
gene expression levels or on the expression levels of a particular gene or plurality of genes 
in response to changes in the expression of a gene of interest. For example, expression of a 
gene of interest (i.e., translation of an mRNA transcribed from the gene), can be repressed 
using an appropriate crRNA element, and expression levels of other genes can be measured. 
Translation can then be activated by a cognate taRNA, and expression levels of the gene(s) 
can be measured again. By comparing expression levels before and after activation of 
translation, the effect of the gene of interest on expression levels of other genes can be 
determined. In general, the expression level of such genes can be measured at either the 
mRNA or protein level by a variety of methods including, but not limited to, microarray 
analysis, Northern blot, RT-PCR, Western blot, immunoassay, etc., or by competitive PCR 
coupled with matrix-assisted laser desorption/ionization-time-of-flight (MALDI/TOF) mass 
spectometry as described herein. 

[00156] There is an increasing interest in creating circuits and performing computations 
using biological components (66). The riboregulators of the present invention can operate in 
such circuits as digital switches, analogous to the role played by transistors in electronic 
circuits. The state of translational repression established by the crRNA elements 
corresponds to the LOW state while the activated state established by the taRNA element 
corresponds to the HIGH state. By using a responsive taRNA element, repeated ON/OFF 
switching can be achieved. The ON or OFF state may also be used for information storage. 
[00157] The riboregulators of the invention are useful for control of bioprocesses. A 
large number of useful substances are most efficiently produced by microorganisms such as 
bacteria or fungi. This includes some pharmaceutical products, food additives and 
supplements, bulk chemicals such as ethanol, and enzymes. In addition, an increasing 
number of useful products including a variety of pharmaceutical agents (e.g., antibodies, 
enzymes) are produced by harvesting them from mammalian cells or culture medium. 
Efficient bioprocess operation frequently involves attempts to control the metabolism of the 
cells involved in the process. For example, it may be desirable to maintain the cells in a 
particular physiological state and then rapidly switch them to a different state, e.g., to 
prevent the accumulation of undesired products or to achieve maximum rate of production 
of the desired product. The riboregulators may be used to alter endogenous metabolic 
processes to improve yield or rate of production. For example, a particular crRNA sequence 
may be positioned upstream of a plurality of open reading frames coding for enzymes that 
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are involved in a single biosynthetic pathway. Expression of the entire set of enzymes may 
be activated by a single taRNA The taRNA may be delivered exogenously, or its 
transcription may be induced. Alemately, a responsive taRNA may be used. In this case, 
presence of the appropriate activating ligand or environmental condition activates the 
taRNA, which then binds to the cognate crRNA present upstream of the open reading 
frames, thereby derepressing translation. 

[00158] The riboregulators may be employed in conjunction with gene knockouts. For 
example, a gene can be knocked out in prokaryotic cells or in eukaryotic cells in tissue 
culture or in eukaryotic organisms (e.g., fungi, mice, using standard methods, and more 
recently pigs, sheep, bovines, etc. using methods known in the art). The gene can then be 
reintroduced with its endogenous promoter and a crRNA element upstream of the coding 
sequence. This will re-establish a responsive endogenous promoter-gene pair that is 
repressed. Physiologic transcription levels can be maintained and post-transcriptional 
expression can be modulated using a cognate taRNA, which can be provided exogenously or 
by inducing its transcription. Alternately, a responsive taRNA can be used, in which case 
translation can be activated by providing the appropriate ligand or environmental condition. 
[00159] The riboregulators also find use for the control of plasmid copy number. In 
addition, the riboregulators can be used in conjunction with in vitro translation systems. 



Examples 

[00160] Example 1: Synthesis and activity of cis-repressive RNA elements 
[001 61 ] This example describes the design of a variety of ris-repressive RNA elements 
(crRNAs) and creation of DNA constructs that provide templates for their synthesis. The 
example further presents measurements demonstrating the ability of these RNA elements to 
repress translation of downstream coding sequences. Example 2 describes corresponding 
fra/w-activating nucleic acid elements (taRNAs) and their ability to activate gene expression 
by relieving the translational repression caused by the the crRNAs. 
[00162] Materials and Methods 

[00163] Plasmid constowction, cell strains, reagents: Basic molecular biology techniques 
were implemented as described in cloning manuals (47). Two riboswitch systems were 
constructed, in which each system utilized two separate promoters to drive the expression of 
the cw-repressive RNAs (crRNA) and fmws-activating RNAs. In the first riboswitch, the 
PL(tetO) promoter drives expression of crRNA, and the pBAD promoter drives expression 
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of taRNA In the second system, PL(lacO) drives the expression of crRNA and PL(tetO) 
drives expression of taRNA. For each system, three main sets of plasmids were constructed 
(Figure 5): (i) crRNA plasmids, (ii) taRNA plasmids, and (iii) riboregulator plasmids. All 
plasmids (Table 4) contained the pBR322 ColEl origin of replication and genes coding for 
either ampicillin or kanamycin resistance. Oligonucleotide primers were purchased from 
Amitof Biotech and Integrated DNA Technologies. All genes and promoters were PCR 
amplified using the PTC-200 PCR machine (MJ Research) with PfiiTurbo DNA Polymerase 
(Stratagene). DNA sequences were obtained as follows: gfpmut36 gene from pJBAl 13 (48), 
PL(lacO) promoter from pZE12-luc (36), PL(tetO) promoter and ribosome binding site 
(RBS) sequence from pZE21 (36), and the arabinose operon (pBAD) from pBADHisA 
(Invitrogen). cis and trans sequences were introduced through oligonucleotide design. 
[00164] Two PCR reactions were performed to construct the stem-loop cis sequences on 
the crRNA plasmids (Table 5). In the first PCR reaction, a forward primer for PL(tetO) [or 
PL(lacO)] was used with a reverse primer for PL(tetO) [PL(lacO)], which contains the cis- 
repressive sequence and a 5'-labelled phosphate end. In the second PCR reaction, a forward 
primer for the RBS site, containing the cr loop sequence, was used with a gfp reverse 
primer. The PCR products were annealed together via blunt-end ligation and cloned into the 
pZE21G (Table 4) vector using unique restriction enzyme sites. The taRNA sequences 
(Table 6) were constructed by annealing two single-stranded, reverse complementary 
oligonucleotides in a DNA hybridization reaction. The double-stranded products 
(approximately 80-100 bp), containing restriction sites, were subsequently cloned into an 
ampicillin-resistant plasmid downstream of the pBAD [or PL(tetO)] promoter. 
[00165] All plasmids were constructed using restriction endonucleases and T4 DNA 
Ligase from New England Biolabs. Plasmids were introduced into the E. coli XL- 10 strain 
(Stratagene; Tef A(mcrA)183 A (mcrCB-hsdSMR-mrr)173 endAl supE44 thi-1 recAl 
gyrA96 relAl lac Hte [F proAB lacl*ZDM15 TnlO (TeO Amy CamO using standard heat- 
shock, TSS, transformation protocols (47). The E. coli XL-10 strain, DH5cc-pro strain 
(Clontech), 2.300 strain (Genetic Stock Center no. 5002, lacl22, rpsL135, and thi-1), and 
wildtype K-12 strain were used for all experiments. All cells were grown in selective 
media: LB (DIFCO) and either 30 |ag/ml kanamycin or 100 jag/ml ampicillin (Sigma). 
Plasmid isolation was performed using PerfectPrep Plasmid Isolation Kits (Eppendorf). 
Subcloning was confirmed by restriction analysis. Plasmid modifications were verified by 
sequencing using the PE Biosystem ABI Prism 377 sequencer. 
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[00166] Gene expression analysis 

[00167] For all experiments, cells were grown overnight in the appropriate conditions, 
diluted 1 : 1000, and re-grown prior to collecting RNA samples and measuring GFP 
expression by flow cytometry. All RNA and GFP measurements were determined during 
logarithmic growth at OD 6 oo 0.4-0.6, measured by a SPECTRAFluor Plus (Tecan). A 
positive control, pZE21G, was constructed such that the promoter drives the expression of 
gfpmut3b without the repressive cis element Cis experiments were conducted under two 
conditions: no anhydrotetracycline (aTc) and 30 ng/ml aTc. An insufficient concentration of 
TetR protein was present in XL- 10 cells to saturate the tetO operator sites. Therefore, in 
control experiments (Fig. 2B), we observed intermediate levels of GFP expression, which 
corresponds to intermediate transcription rates. DH5oc-pro cells contained higher cellular 
levels of TetR, and thus demonstrated a lower expression state at no aTc inductioa 
Cis/trans experiments were conducted under four conditions: (i) no aTc, no arabinose, (ii) 
no aTc, 0.25% arabinose, (iii) 30 ng/ml aTc, no arabinose, and (iv) 30 ng/ml aTc, 0.25% 
arabinose. In these experiments, aTc controls the transcription of crRNA and arabinose 
controls the expression of taRNA. We measured the expression of the riboregulator systems 
in two additional strains, lacking TetR protein production: 2.300 strain and wildtype K-12 
strain. In these strains, we grew cultures containing riboregulator systems in the absence 
and presence of arabinose and obtained results consistent with those obtained using the other 
strains (XL10 and DH5ctpro). 
[001 68] GFP quantitation by flow cytometry 

[00169] All expression data were collected using a Becton Dickinson FACSCalibur flow 
cytometer with a 488 nm argon laser and a 515-545 nm emission filter (FL1) at low flow 
rate. Before analysis, cells were pelleted and resuspended in filtered PBS (Life 
Technologies, pH=7.2) immediately following each time point Calibrite Beads (Becton 
Dickinson) were used to calibrate the flow cytometer. Each fluorescent measurement of 
gene expression was obtained from populations of > 100,000 cells. Flow data were 
converted to ASCII format using MFI software (E. Martz, University of Massachusetts, 
Amherst). Matlab (Mathworks, Inc., Massachusetts) software was used to filter (in a narrow 
forward scatter range) and analyze a homogenous population of cells in each sample. 
[00170] Quantification of cellular RNA concentrationsircPCR Gene Expression Analysis 
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[00171] Real competitive PCR (rcPCR) was carried out in essentially the same way as 
previously reported (38). The assay designs for 16S rRNA, taRNA and crRNA are 
described in Table 7. The steps of rcPCR are described briefly below. 
[00172] Step 1 : Reverse transcription Total RNA samples were obtained from cultures 
in logarithmic growth Cultures were immediately placed in RNAprotect (Qiagen), and 
RNA was isolated using RNeasy Mini Kit (Qiagen). RNA samples were subjected to a 
DNase I (DNA-free, Ambion) digestion and diluted 10 times before reverse transcription. 
Each reverse transcription reaction contains 1 mL diluted RNA, 1 mL ImProm-II 5 buffer, 1 
mL MgCl 2 (25 mM), 0.3 mL dNTP mix (10 mM each), 0.3 mL ImProm-II reverse 
transciptase (Promega), 0.5 mL random primer (0.5 mg/mL) and 0.9 mL RNAse free water. 
Only RNA was added first and heated at 70°C for 5 min and put on ice immediately. The 
remaining reagents were added and reverse transcription was carried out by incubating at 
25°C for 5 min, followed by 42°C for 1 hour and finally 70°C for 15 min to inactivate the 
reverse transcriptase. All temperature controlled reactions (reverse transcription, PCR 
amplification and base extension) were carried out in a GeneAmp 9700 thermocycler (ABI). 
[00173] Step 2: PCR amplification. Reverse transcription products were diluted 10 times 
before PCR Each PCR reaction contains 1 mL diluted cDNA, 0.5 mL 10 HotStar Taq PCR 
buffer, 0.2 mL MgCl 2 (25 mM), 0.04 mL dNTP mix (25 mM each), 0.02 mL HotStar Taq 
Polymerase (50 U/mL, Qiagen), 0.01 mL competitor DNA, 1 mL forward and reverse 
primer (1 mM each) and 2.23 mL ddH 2 0. The PCR condition was: 95°C for 15 min for hot 
start, followed by denaturing at 94°C for 20 sec, annealing at 56°C for 30 sec and extension 
at 72°C for 1 min for 45 cycles, with final incubation at 72°C for 3 min. 
[00174] Step 3: Base extension PCR products were treated with shrimp alkaline 
phosphatase (SEQUENOM) for 20 min at 37°C first to remove excess dNTPs. A mixture of 
0.17 mL hME buffer (SEQUENOM), 0.3 mL shrimp alkaline phosphatase (SEQUENOM) 
and 1.53 mL ddH 2 0 was added to each PCR reactioa The reaction solutions (now 7 mL 
each) were incubated at 37°C for 20 min to remove excess dNTPs, followed by 85°C for 5 
min to inactive the phosphatase. For each base extension reaction, 0.2 mL of selected 
ddNTP/dNTP mix (SEQUENOM), 0.108 mL of selected extension primer, 0.018 mL of 
ThermoS equenase (32 U/mL, SEQUENOM) and 1.674 mL ddH 2 0 were added The base 
extension condition was as follows: 94°C for 2 min, followed by 40 cycles of 94°C for 5 sec, 
52°C for 5 sec and 72°C for 5 sec. 
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[00175] Step 4: Liquid dispensing and MALDI-TOF MS. The final base extension 
products were treated with SpectroCLEAN (SEQUENOM) resin to remove salts in the 
reaction buffer. This step was carried out with a Multimek (Beckman) 96 channel auto- 
pipette. Sixteen jaL resin/water solution was added into each base extension reaction, 
making the total volume 25 jiL. After centrifugation (2,500 rpm, 3 min) in a Sorvall Legend 
RT centrifuge, approximately 10 nL of reaction solution was dispensed onto a 384 format 
SpectroCHIP (SEQUENOM) pre-spotted with a matrix of 3-hydroxypicolinic acid (3-HPA) 
by using a MassARRAY nanodispenser (SEQUENOM). A modified Bruker Biflex 
MALDI-TOF mass spectrometer was used for data acquisitions from the SpectroCHIP. 
Mass spectrometric data were automatically imported into the SpectroTYPER 
(SEQUENOM) database for automatic analysis such as noise normalization and peak area 
analysis. The allelic frequency of 16SrRNa, crRNa, and taRNA were exported to Excel 
(Microsoft Office) and analyzed The reported concentrations of crRNA and taRNA in 
Table 1 are expressed as a percentage of 16SrRNA concentration within each sample. 
[00176] Table 5 lists cw-repressive RNA sequences in the crRNA constructs, loop 
containing the YUNR (TTGG) recognition motif, and ribosome binding site (RBS) used 
herein. 

[00177] Results 

[00178] Several features of endogenous riboregulators (17, 33, 34) were used to guide the 
construction of this artificial post-transcriptional regulatory system. With regard to the 
crRNA component, three main features were prominent in the design. First, the DNA 
template for the crRNA is designed according to the following considerations. The cis 
repressive sequence, which consists of a 19 base pair (bp) reverse complementary sequence 
to the RBS, is strategically placed directly downstream of (i.e., in the 3' direction from) the 
promoter and upstream of (i.e., in the 5' direction from) the RBS sequence, so that in the 
mRNA transcript the cw-repressive sequence is located in the 5' UTR. Importantly, the 
introduced cis sequence does not alter the coding frame of the targeted gene and does not 
affect native transcription Second, a short nucleotide sequence, placed between the cis- 
repressive sequence and the RBS, permits formation of a hairpin stem-loop structure in 
which the ds-repressive RNA and the RBS form the stem, and the short intervening 
nucleotide sequence forms the loop. Third, a YUNR (pYrimidine-Uracil-Nucleotide- 
puRine) consensus sequence, which has been shown to be an important target for 
intermolecular RNA complexes in the native RI system (34), is included in the loop region 
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in the constructs described here, and it is generally preferred that the crRNA includes this 
sequence in the loop region While not wishing to be bound by any theory, it is believed 
that this motif directs taRNA-crRNA binding through a linear-loop intermolecular 
interaction, as shown schematically in Fig. 3C. The taRNA stem contains the nucleotide 
sequence that is complementary to the cw-repressive sequence. In preferred taRNA 
elements this sequence, which possesses high sequence similarity to the RBS, is sequestered 
in the taRNA stem structure to prevent aberrant titration of ribosomes and eliminate any 
possible pleiotropic effect Because the intermolecular RNA interactions rely on specific 
RNA structures, we utilized the Mfold web server (35) using default parameters to generate 
predicted RNA secondary structures. These predicted RNA secondary structures guided the 
sequence and assembly of all RNA sequences. In particular, sequences that yielded more 
than one predicted secondary structure were eliminated. 

[00179] To assess in vivo repressive ability of the 5'-UTR cis element, four crRNA 
variants (crRL, crR7, crRlO, and crRB) were constructed on episomal plasmids that 
propagate in Escherichia coli (E. coli) cells. Four crRNA variants (Fig. 2A), with varying 
degrees of stem sequence complementarity to the RBS, were constructed for two main 
reasons: First, to determine the extent of sequence complementarity required for sufficient 
post-transcriptional repression and second, to investigate if changes in stem sequences, 
which introduce partial complementarity and result in alternate RNA secondary structures 
[i.e., RNA duplex (crRL), inner loops (crR7, crRB) and bulges (crRlO)], destabilize the stem 
loop to help generate an open complex when targeted for activation by rraHS-activating RNA 
(taRNA) (see Example 2). 

[00180] We chose the constitutive PL(tetO) promoter (36), a modified version of the 
native Phage X PL promoter containing two TetR operator sites, to drive the expression of 
each crRNA transcript in which transcription can be modulated by the TetR protein and its 
chemical inducer anhydrotetracycline (aTc). A 25 nucleotide (nt) DNA sequence was 
cloned 27 nt downstream of the the PL(tetO) promoter, such that this cw-repressive 
sequence is present on the 5' UTR of the mRNA (crRNA). The cis sequence included two 
sections: a 19 nt stem sequence, complementary or substantially complementary to the 
RBS, and a 6 nt loop region. A synthetic ribosome binding site from the pZ plasmid system 
(36) and the gfpmut3b gene (37) were cloned directly downstream of the cis sequence. 
Single-cell fluorescence measurements of the Green Fluorescent Protein (GFP) were used to 
monitor the expression state of this post-transcriptional system by flow cytometry. A 
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control plasmid that lacks the cis element and contains an arbitrary sequence upstream of the 
RBS was also constructed (Fig. 2). 

[00181] Flow-cytometric measurements from single cells containing control plasmids and 
constitutive expression of TetR protein show an elevated GFP state at intermediate (no aTc) 
and high (30 ng/ml aTc) transcription rates (Fig. 2B,C). Cells possessing plasmids with the 
upstream cis-repressive elements (crRNAs) were grown under the same conditions. 
Moderate GFP leakage was detected in cultures containing the crRB variant, which contains 
reduced cis sequence complementarity to the RBS. Due to the elevated levels of expression, 
together with a variable secondary structure predicted to lose its recognition site in the stem- 
loop (Fig. 2A), the crRB variant was not used in subsequent investigation. At intermediate 
transcription rates (Fig. 2B), crRL cultures show repressed levels of GFP expression 
indistinguishable from autofluorescence cellular measurements (determined by measuring 
fluorescence of cells containing plasmids that lack GFP). The crR7 and crRlO cultures, 
which demonstrate slightly elevated levels of GFP expression, also show dramatic silencing 
of gene expression At high transcription (Fig. 2C), we also observe low GFP expression 
values for all variants indicating that the cw-repressive 5 f -UTR element renders striking 
suppression of post-transcriptional expression Our results also indicate that the degree of 
repression is not entirely correlated with predicted ^Gm/oM values (Table 1) and base-pairing 
in the stem region [crR7 vs. crRlO, crR12 (see below)]. Thus, placement of the mismatches 
and the resulting structures (i.e., inner loops proximate to the stem-loop and bulges) impact 
the stability of the hairpin stem-loop and the degree of repression. Furthermore, the 
observed repression of > 96% (intermediate transcription) and > 97% (high transcription) 
(98% when background autofluorescence is subtracted) provides improved silencing when 
compared to alternative antisense and trans ribozyme systems that target specific open 
reading frames or RBS sequences (31, 32). 

[00182] In order to confirm that the observed silencing is due to the presence of 
translational repression by the cis sequence, we measured cellular mRNA concentrations. 
Total cell RNA was isolated from cultures containing each crRNA variant and the control 
plasmid, permitting quantitative measurements of mRNA levels by competitive PCR 
coupled with matrix-assisted laser desorption/ionization-time-of-flight (MALDI-TOF) mass 
spectrometry (38). Table 1 lists the measured mRNA concentrations, which are normalized 
by endogenous levels of 16S rRNA in each sample. We consistently observe a four-fold 
increase in mRNA concentration upon shifting from intermediate to high transcription rates 
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(+aTc/-aTc). The RNA concentration results also demonstrate that the crRNA variants are 
present at 40% of the mRNA levels measured from the control cultures. Possible causes of 
RNA loss include premature transcription termination downstream of the haiipin stem-loop 
structure or targeted degradation by RNases that cleave double-stranded RNAs (40, 41). 
Despite the moderate loss of cellular mRNA concentrations, crRNA levels at high 
transcription (+ aTc) are greater than intermediate (-aTc) mRNA control levels indicating 
that sufficient levels of mRNA are available for ribosomal recognition and can serve as 
templates for protein synthesis. Together with the GFP data, these results demonstrate that 
the hairpin stem-loop, which preferentially forms due to the placement of the upstream cis 
sequence, prevents ribosome binding at the RBS and interferes with post-transcriptional 
gene expression. 
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[00183] Table 4: List of Plasmids 



Parent Plasmi«i(s) 



Plasmid 



Description 



pZE21G 
pZE21C 
pZE2IG 
pZE2IG 
P ZE21G 
pZE2IG 

P ZEl2-luc(J).pZE2lal2G 

pZEl24uc (J). pBADHis-A 1 

pZEIS 

P 2C15 

pZEI5 

pZEIS 

pZEIS 

P ZEI5YI2 

PZE15 

pZElSYI2.pZE2IG 

p ZElSYLpZE2lrtLG 

P ZEl5Y7.pZE2loLG 

pZEl5YIO,pZE2l«LG 

pZEl5YI2.pZE2loLG 

P ZEI5YUpZE2lo7G 

pZEI5Y7, pZE2lr*7G 

pZEIJYIO. pZ£2W»7tt 

P ZEI5YLl2.pZE2W*7G 

pZEUYL. pZE2la!0G 

pZEI5Y7. pZE2lnlOG 

P ZEI5YI0. pZE2lr»IUG 

pZEI5YI2,pZE2lolt>G 

pZEISYU pZE2IaI2G 

pZEI5Y7,pZE2lr>!2G 

pZElSYIO. pZE2lr*l2G 

pZEi5YI2,pZE2lr>!2Ci 

pZEl5YB.pZE2l#>BG 

pZEISYUst. pZE21r»!2G 

pZEI5Y22, pZE2lr>22G 

pZER2IYI2*>l2G 

P ZER21YI2ol2G 

pZER2IYI2ol2G 

pZER2IYI2o!2G 

pZER2IYI2ol2G 

pZCR2IYI2«l2G 

pZEIIYI2.pZE22oi2G 

pZEMYJ2.pZE22wl2G 

pZEIIYI2.pZE22al2G 

P ZEllYI2.pZE22ol2G 

pZEHY»2.pZE22ot2G 

P ZEHYt2. P ZE22ol2G 



piEjnr 

pZE2loLG 
pZE2laBG 
pZE2la7G 
P ZE2lalOG 
P ZE2lal2G 
pZE2ta22G 
pZE22al2G 
pZEI5 
pZEISYL 
pZEI5YB 
pZEI5YL7 
P ZE15YLI0 
pZE15YL12 
pZE15YI2st 
pZEI5Y22 
P ZE1IYt2 
pZER2IYLaLG 
pZER2l Y7qLG 
P ZER2lYI0aLG 
P ZER2lYI2aLG 
P ZER2IYLa7G 
pZER2IY7a7G 
pZER2IYI0a7G 
P ZER2IYl2a7G 
pZER2lYLo!0G 
pZER2lY7a!OG 
pZER2IYI0ol0G 
pZER2lYI2alOG 
pZER21YLn>l2G 
pZER21Y7al2G 
pZER2IYI0al2G 
P ZER2IYI2al2G 
pZER2l YBaBG 
pZER2lYl2sta!2G 
pZER21Y22o22G 
pZER2lYl2Aal2G 
pZER2IY)2Bal2G 
pZER2IY12Cal2G 
P ZER21UIYl2ol2G 
P ZER21U2YI2ol2G 
. P ZER2IU3YI2al2G 
pZER22YI2-la!2G 
pZER22YI20al2G 
pZER22YI2*5al2G 
P ZER22YI2-l9al2G 
P ZER22YI2-21al2G 
pZER22Y»2-23ol2C 



ColEl-vcctor. kanamycin resistance, PL(tctO) producing gfpmuiJb 
crRL sequence inserted downstream of PL(tetO) 
crRB sequence inserted downstream of PUtctO) 
crR7 sequence inserted downstream of PL(tctO) 
crR 10 sequence inserted downstream of PUtctO) 
crRI2 sequence inserted downstream of PUtctO) 
crR22 (short cr element) 

crR 1 2 sequence inserted downstream of PLflacO) 

Col El -vector, ampicillin resistance, pBAD promoter 

pDAD producing taRL 

pBAD producing laRB 

pBAD producing taR7 

pBAD producing taR 10 

pBAD producing taR 1 2 

taRt 2 with 5' itabiliaer element (7, ff) 

pBAD producing taR22 

PL(tetO) producing URI2 

taRL-crRL Riboswitch 

taR7-crRL Riboswitch 

taRlO-crRL Riboswitch 

taR] 2-crRL Riboswitch 

taRL-crR7 Riboswitch 

taR7-crR7 Riboswitch 

taRll>-crR7 Riboswitch 

taRI2-crR7 Riboswitch 

taRL-crRIO Riboswitch 

taR7-crR10 Riboswitch 

taR 1 0-crR 10 Riboswitch 

taRI2-crRl0 Riboswitch 

taRL-crRU Riboswitch 

taR7-crRI2 Riboswitch 

taRIO-crRl2 Riboswitch 

taR12-crRI2 Riboswitch 

URB-crRB Riboswitch 

taRl2-crR12 Riboswitch + S 1 stabilizer element 
taR22-crR22 Riboswitch 
taR12A-crR12 Riboswitch, 96% taR-crR duplex 
taR12B-crR12 Riboswitch, 96% taR-crR duplex 
taR i 2C-crR 1 2 Riboswitch, 1 00% taR-crR duplex 
taRI2-crRJ2: 3* stem or taRI2 binds to 5' UTR of crR!2 
taRI2-crRI2: 3' stem oftaRI2 binds to 5* UTR of crR 12 
taR12~crRI2: 3* stem of taR 1 2 binds to 5' UTR of crR 12 
taRI2-crRl2 Riboswitch II; +1 transcription of laR 1 2 
taRI2-crRl2 Riboswitch II; +3 transcription of taR 12 
taR12-crRl2 Riboswitch II; +5 transcription of taR 1 2 
taR !2-crRI2 Riboswitch II; +19 transcription of taR 12 
taRI2-crR12 Riboswitch II; +21 transcription of taR 1 2 
taRl2-crR12 Riboswitch II; +23 transcription of taR12 
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[00184] Table 5: Sequences of Qs-repressive RNA Sequences, Loop, RBS, and crRNA 
Constructs. All sequences are shown 5 y to 3' and represented in the form of their 
corresponding DNA sequences as used in the cloning steps. 



L//a-ivcpressivc oequcnuc 


Qpnupnrp IT) "NO* 


C GGACGCACTGACCGAATTC 


SEQIDNO:3 


crRL CTACCTTT C I 1 CT1TAAT 


bbQ JJJ 1NU: 4 


crRB TTCTCTAGTCCTCCTTAT 


SEQ ID NO: 5 


crR7 CTACC1 1 ICTCCTCTAGGA 


SEQ ID NO: 0 


crRlO CTACCTATCTGCTCTTGAA 


SEQIDNO:7 


crR12 CTACCATTCACCTCTTGGA 


SEQ ID NO: 8 


crR22 CTACCATTCACCTGGA 


SEQ ID NO: 9 


Loop TTTGGGT 


SEQ ID NO: 10 


RBS ATTAAAGAGGAGAAA 


SEQ ID NO: 11 






Sequence of Czs-Repressive RNA Constructs 




C GGACGCACTGACCGAATTCATTAAAGAGGAGAAA 
GGTACCATG 


SEQ ID NO: 12 


crRL CTACCT1TCTCCTCTTTAATTTTGGGTATTAAAGAG 
GAGAAAGGTACCATG 


SEQ ID NO: 13 


crRB CTCTAGTCCTCCTTATTTTGGGTATTAAAGAGGAG 
AAAGGTACCATG 


SEQ ID NO: 14 


crR7 CTACCTTTCTCCTCTACK3ATTTGGGTATTAAAGAG 
GAGAAAGGTACCATG 


SEQ ID NO: 15 


crRlO CTACCTATCrGCTCTTGAATTTGGGTATTAAAGAG 
GAGAAAGGTACCATG 


SEQ ID NO: 16 


crR12 CTACCATTCACCTCTTGGATTTGGGTATTAAAGAG 
GAGAAAGGTACCATG 


SEQ ID NO: 17 


cr22 CTACCATTCACCTCTTGGATTTGGGTATTAAAGAG 
GAGAAAGGTACCATG 


SEQ ID NO: 18 



[001 85] Example 2: Synthesis and activity of trans-activating RNA elements 

[00186] This example describes the creation of DNA constructs that provide templates for 

synthesis of a variety of different fraws-activating RNA elements that operate in conjunction 

with corresponding czs-repressive RNA elements described in Example 1. The example 

further presents measurements demonstrating the ability of these RNA elements to activate 

translation of coding sequences whose translation was previously repressed by the 

corresponding cw-repressive RNA. 

[00187] Materials and Methods 

[00188] See example 1. 

[00189] Results 

[00190] Small, fraw-activating RNAs (taRNAs), designed to cause the crRNAs described 
in Example 1 to undergo structural transformation to expose the RBS and initiate translation, 
were produced. The taRNA sequences were selected so as to direct loop (crRNA) - linear 
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(taRNA) RNA pairing. The mode of RNA-RNA interaction was designed based on several 
characterized natural RNA systems (17, 34), e.g., the hok/sok postsegregational killing 
system of plasmid RI (34). While not wishing to be bound by any theory, our artificial 
riboregulator system undergoes the following proposed mechanism: i) the 5'-linear region of 
the taRNA recognizes a YUNR consensus sequence (UUGG) (34) on the loop of crRNA, ii) 
pairing between complementary nucleotides occurs in the presence of an unstable loop-tail 
complex, and iii) an intermolecular RNA duplex structure forms (Fig. 3C). The resulting 
RNA duplex induces a structural change, which permits ribosomal recognition of the 
previously obstructed RBS followed by translation. Because the final RNA complex would 
otherwise include 26 consecutive base pairs, two bulges were intentionally introduced into 
its structure to provide immunity from RNase HI cleavage of RNA duplexes (40, 41). 
[00191] In order to assess the activation ability of each crRNA variant, unique taRNA 
structures were designed for each crRNA target ensuring that the final duplex structures all 
contain 24 base pair matches and two dispersed bulges. Table 6 presents sequences of the 
taRNA molecules that were generated. These taRNA molecules were produced in vivo from 
the arabinose operon (pB AD), such that their transcription rates could be modulated by the 
presence of arabinose sugar and AraC protein (endogenously present in the cell). Initially, 
three taRNA-crRNA cognate pairs (taRL-crRL, taR7-taR7, and taRlO-taRlO) were 
investigated to measure the resulting activation of GFP expression in the presence of the 
small trans-activating RNAs. Cultures containing the crRL and crR7 variants show no 
detectable increase in GFP expression at high arabinose induction of taRL and taR7, 
respectively. However, upon induction of taRlO, cultures containing crRlO exhibit 5x 
increase in GFP expression (Fig. 3D,F). 

[00192] Based on the results obtained with the initial set of taRNA-crRNA pairs, we 
constructed another taRNA-crRNA pair: taR12 and crR12 (Fig. 3A, B). The crR12 variant, 
similar in structure to crRlO, also contains three dispersed bulges in its predicted secondary 
structure. In the absence of arabinose, cells containing crR12 show a low, near 
autofluorescence repressed state, and upon arabinose induction, we observe a lOx increase 
in GFP expression (Fig. 3E, F). These results suggest that partial helix destabilization (e.g., 
presence of bulges) in crRNA seems to be important for the taRNA to mediate conversion 
between a closed and open complex to form an intermolecular RNA duplex, which enables 
protein translation. As compared to the endogenous DsrA-RpoS system, which exhibits 3x 
activation (17), our results demonstrate significantly improved level of activation. 
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[00193] In Fig. 3F, we present the dose-response curves in which each point represents 
the averaged response of a population to a particular level of induction with arabinose. 
Average response was obtained by measuring mean GFP fluorescence from a uniform 
population of cells using flow cytometry. Qualitatively similar dose-response curves were 
obtained for both riboregulator pairs: We observed no activation at low (< lO^/o) arabinose 
concentrations, followed by a rise in activation at intermediate (10^-10" 2 %) arabinose 
concentrations, and finally a high state which plateaus at elevated levels (> 10" 2 %) of 
arabinose. These data show tunable activation of post-transcriptional expression through the 
controlled introduction of rra/w-activating RNA Interestingly, the taR12:crR12 pair 
demonstrates a larger dynamic range than the taR10:crR10 riboswitch. One possible 
explanation for this result is the following: Scrutiny of the flow cytometric histograms 
depict greater cis repression and higher trans activation for the crR12 variant (Fig. 3E) than 
for the crRlO variant (Fig. 3D) resulting in greater separation between low and high states. 
More specifically, these observations may result from the stability of RNA secondary 
structures and the efficiency of intramolecular (crRNA) vs. intermolecular (taRNAcrRNA) 
RNA interactions, all of which may be important in the ultimate phenotypic response of this 
post-transcriptional system. 
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[00194] Table 6: Sequences of 7>a/w-activating RNA Constructs. 5'-st represents the 5* 
stabilizer element inserted in front of taR12. All sequences are shown 5' to 3' and 



represented in the form of their corresponding DNA sequences as used in the cloning steps. 



Construct/Sequence 


Sequence ID NO 


taRL 


ACACCCAAATTAAAGAGGAGAAAGGTAGTGGTGGTTAATGAAA- 

ATT* A A PTTA tT A OHT A OOTTTTV *I"P A A 
All AAL 1 1 AL, 1 AL 1 ALL 1 1 1 1 L 1 1 ALjA 


SEQIDNO: 19 


taRB 


ACGCCCAATAAGG A GG ATA GAGTGGTGGTTAATGAAAATTAAC- 

TT* A r A nT A nTTTT A ^ A 
1 IALIAL1 lADrl I 1 1AOA 


SEQ ID NO: 20 


taR7 


ACACCCAaATCCTAGGGAGAATGGTAGTGGTGGTTaATGAAAA- 

TTA A PTT A PT A PT A PTTTTTP ATAP.A 
1 1 /Vrt.\^ 1 1 rW* I AO 1 At 11111 LA 1 AVJA 


SEQIDNO: 21 


taRlO 


ACACCCAAATTATGAGCAGATTGGTAGTGGTGGTTaATGAAaA- 
TT A A PTT A PT A PT A PTTTPTT A Ci A 


SEQ ID NO: 22 


taR12 


ACCCAaATCCAGGAGGTGATTGGTAGTGGTGGTTAATGAAAAT- 

taacttactactaccatatatctctaga 


SEQ ID NO: 23 


taRl2A 


ACCCaAATCCAGGAGGTGAATGGTAGTGGTGGTTaATGaaaAT- 

taacttactactaccatatatctctaga 


SEQIDNO: 24 


taR12B 


acccaaatccaagaggtgattggtagtggtggttaatgaaaat- 
taacttactactaccatatatctctaga 


SEQ ID NO: 25 


taR12C 


acccaaatccaaagaggtgaatggtaagtgggtggttaatgaa- 

AATTAACITACTACTACCATATATTCTCTaAGA 


SEQ ID NO: 26 


taRU112 


ACCCAAATCCAGGAGGTGATTGGTAGTGGTGGTTaATGAAaAT- 
TaACTTACTAAAATCGGACATCTCTAGA 


SEQ ID NO: 27 


faRU212 


ACCCaaATCCAGGAGGTGATTGGTAGTGGTGGTTaATGAAAAT- 
TaACITTACTACTTACGCGTCATATCTCTAGA 


SEQ ID NO: 28 


taRU312 


ACCCAAATCCAGGAGGTGATTGGTAGTGGTGGTTaATGAAAAT- 
TaACTTACTACGATCAGTGATCTCTAGA 


SEQ ID NO: 29 


taR22 


ACCCaAATCCAGGTGTATGGTAGTGGTGGTTAATGAAAATTAAC 
TTACTACCATTCACCTCGATCTAGA 


SEQ ID NO: 30 


5'-st 


GGGUCCGCUAUGAGGUAaAGUGUCAUAGCGGGCCC 


SEQ ID NO: 31 



5 



[00195] Example 3: Specificity of cis-repressive and trans-activating RNA pairs 
[00196] Materials and Methods 

[00197] Equilibrium constant measurements: The equilibrium constants for complexes 
between the cw-repressive and /ra/w-activating RNAs can be measured in several different 

10 ways. Classic methods include electrophoretic mobility shift assays in polyacrylamide gels 
containing divalent cations (49). Here, we use an approach based on the property of reverse 
transcriptase, which stalls and terminates on stable RNA duplexes. When hybridized to 
crRNA, taRNA creates an obstacle for the reverse transcriptase, yielding a truncated 
product The amount of truncated transcripts versus full length transcripts is assayed by 

1 5 polyacrylamide gel electrophoresis. From these data one can calculate equilibrium 

association and dissociation constants. This method is advantageous over classic methods 
because it uses fluorescence rather than radioactive probes and does not involve RNA cross- 
linking agents. 
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[00198] Step 1 : In vitro RNA transcriptioa The RNA samples were synthesized using 
the MAXIscript T7 In Vitro Transcription Kit (Arabion). Prior to transcription, the genes of 
interest were PCR amplified from the respective plasmids. Forward primers contained the 
T7 promoter sequence at the 5 f overhang end The reverse primers were selected to obtain 
the desired length of the in vitro transcript (Table 7). Each of the in vitro transcription 
reactions contained 300-500 ng of PCR product, yielding approximately 3 jig of RNA 
(Ambion protocols). The template DNA was removed by DNAse I treatment (Ambion 
DNA-free). Products of in vitro transcription were purified by phenol extraction followed 
by ethanol precipitation (47). After removal of unincorporated ribonucleotides, the 
transcripts were transferred to Microtest™ 96-well UV-Vis transparent clear plates (BD 
Falcon) and quantified by UV absorbance (260nm) using a SPECTRAFluor Plus (Tecan). 
[00199] Step 2: Complex formation. For each of the riboregulator pairs, six samples with 
different molar ratios of taRNA-crRNA were prepared The concentrations of taRNA in the 
six samples were: 1.0 /pM 9 0.50 ^iM, 0.25 jiM, 0.13 ^M, 0.06 foM, and 0.03 |aM. The 
concentrations of crRNA were 0.20 |iM and 0.01 for cognate (e.g., taR12-crR12) and 
non-cognate (e.g., taR10-crR12) pairs, respectively. Each of the samples contained 10 |iM 
Tris (pH=7), 10 nM MgCl 2 , 1 pM KCI, 1U of RNAse inhibitor (Applied BioSystems), and 
0.4 pM of Cy5-labeled reverse transcription primer (5'-Cy5-CTTCACCCTCTCCACTGAC- 
3') (SEQ ID NO:32). The reverse transcription primer was designed to anneal the crRNA 
approximately 80 nucleotides downstream of the gfpmut3b start codon and contained the 
Cy-5 label at the 5' end. The samples were given 20 minutes to equilibrate at 37°C. 
[00200] Step 3: Reverse transcription. Reverse transcription was carried out using the 
TaqMan Reverse Transcription Kit (Applied BioSystems). For each reverse transcription 
reaction, 5 ^iL of the complex obtained in the previous step and 2.5 \xL of the RT reagents 
were combined Each reaction contained 5.5 mM MgCl 2 . The reaction conditions were as 
follows: 15 minutes at 37°C, followed by addition of 5 \\L of stop solution 
(formamide:EDTA:bromphenol blue). Reaction products were eluted in denaturing 6% 
polyacrylamide gel (6M urea) and analyzed using ALF sequencing system (Amersham 
Biosciences). The dideoxy sequencing reaction of the crR7 clone was used as a reference 
DNA ladder. 

[00201] Table 7 presents details of the real-time competitive PCR assay design including 
a list of primers used to amplify RT-PCR products obtained from RNA cell preparations. A 
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terminator mix using three different ddNTPs and one dNTP. For example, CGT mix for 16S 
rRNA is ddCTP/ddGTP/ddTTP/dATP. Table 8 presents a list of primers used for in vitro 
PCR amplification. 
[00202] Results 

[00203] To determine if the artificial riboswitch pairs demonstrate high specificity, we 
investigated all 16 combinations (L, 7, 10, and 12) of the taRNA-crRNA constructs. In Fig. 
3G, we present results from separate cultures containing crR12 and four different taRNAs 
(taRL, 7,10,12). At no arabinose induction, we measure effective derepression at near 
autofluorescence levels of GFP expression and nearly undetectable concentrations of 
taRNAs, obtained by competitive PCR coupled with MALDI-TOF mass spectrometry (38). 
Upon arabinose induction, we detect a dramatic increase in RNA concentration of all taRNA 
variants, yet we only observe lOx activation at the protein level (GFP) in the taR12:crRI2 
cognate pair (Fig. 3G). The same experiments were conducted with the crRL, 7, 10 variants, 
and at high taRNA levels, GFP activation is only observed for the taRlO-crRlO cognate pair. 
Thus, these results indicate that taRNA-crRNA complexes, which interact and undergo 
structural rearrangements to expose the RBS, rely on highly specific cognate RNA pairings. 
[00204] These constructs were subsequently used to prepare DNA templates by PCR for 
in vitro transcription of RNA fragments. The transcribed RNAs were produced from the T7 
promoter, and all taRNA-crRNA pairs were investigated to assess the in vitro specificity of 
interactions. 

[00205] We first conducted preliminary experiments using fixed concentrations of cis- 
repressive and /ra^-activating RNAs. The x-axis in Fig 6 corresponds to elution time and 
can be mapped to nucleotide sequences of cis-repressive RNAs using dideoxy sequencing 
protocols (39). We found that the time intervals 165-185 min and 210-230 min correspond 
to truncated transcripts and full length transcripts, respectively. It is remarkable that the 
cognate pairs (i.e., taR7-crR7, taRlO-crRlO, and taR12crR12) have substantial peaks 
corresponding to taRNA-crRNA complex, while the non-cognate pairs show almost no 
interaction. Therefore, we expect that: (i) the equilibrium association constants of the 
cognate pairs have much higher values those of the non-cognate pairs and (ii) in order to 
determine the equilibrium constants for non-cognate pairs, one must use an excess of taRNA 
to obtain a detectable amount of the taRNA-crRNA complex. 

[00206] Determination of equilibrium association constants for complexes between 
taRNA and crRNA was performed as described above. Reverse transcription profiles were 
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obtained for each of nine taRNA-crRNA pairs at six different concentrations of taRNA 
Figure 7 is an example profile, specifically for the taR7-crR12 pair. The peaks are: 92 
minutes (primer); 170-180 minutes (due to termination on cis repressive secondary 
structure); 180-190 minutes (termination on the taRNA-crRNA complex); 210-220 minutes 
(minor termination on additional secondary structure); 240-250 minutes (full length reverse 
transcript of crRNA). Each curve was integrated between 1 80 and 200 minutes and between 
210 and 250 minutes using Fragment Manager (Pharmacia Biotech). The ratio of the 
integrals are equal to the ratio of equilibrium concentrations of the taRNA-crRNA complex 
and free crRNA, respectively. 

[00207] From these data, the equilibrium dissociation constant was calculated as in 
reference 50, namely, the equilibrium dissociation constant KD for the reaction 
cr + ta <---> cr ° ta is KD = [cr][ta]/[cr ° ta], where square brackets denote equilibrium 
concentrations. If CR and TA correspond to the initial concentrations of crRNA and taRNA, 
respectively, then, CR = [cr] + [cr . ta] and TA = [ta] + [cr - ta]. Alternatively, [cr . ta]/CR 
is equal to x = Sc/(S C + S/), where S c and S/are the peak areas of the complex and the full 
length transcript, respectively. Therefore, x *K D = (1 - *)(TA - x . CR). Thus, K D is equal to 
the slope of the linear regression of TA - x » CR versus xl{\ - x). In Figure 8 we show an 
example calculation for the crR12-taR7 pair. Here K D is 1 .03 jjM, and Ka, which is the 
inverse ofK Di is 9.7* \0 S M'\ 

[00208] We were able to measure the equilibrium association constants for the 7, 10, and 
12 pairs. The association constants (Table 2) of cognate pairs (i.e., ta7-cr7, talO-crlO, and 
tal2-crl2) demonstrate greater than lOx higher values than non-cognate pairs (i.e., tal0-cr7, 
tal2-crl0, etc.). These data are consistent with the measurements of fold change of 
fluorescence, in which the target pairs show a remarkable increase in gene expression. 
Interestingly, the taR12-crR12 pair had the biggest fold change of fluorescence, although the 
association constants of all cognate pairs were of the same order of magnitude. The 
discrepancy we observe may be caused by differing conditions of RNA-RNA interaction 
between in vitro and in vivo studies. In addition, other factors, such as concentration of 
small ions or presence of proteins in the cell, may influence these interactions, hi principle, 
the in vitro studies show that the taRNA-crRNA interaction for the non-cognate pairs is not 
thermodynamically favorable when compared to the cognate pairs. 
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[002091 Table 7: Real-competitive PGR Assay Design. 



Assay: 
16SrRNA 


PCR Primer 1 : 5 '-ACGTTGGATGGGAGACTGCCAGTGATAAAC 
(SEQIDNO: 33) 

PCR Primer 2: 5-ACGTTGGATGTGTAGCCCTGGTCGTAAGG 
(SEQIDNO: 34) 

Extension Primer: 5-GAGGAAGGTGGGGATGACGT (SEQ ID NO: 36) 
Terminator Mix: CGT 

Pnmneti tnr Sea ■ 5-TfiTA GCCCTGGTCGTAAGGGCCATGATG- 

ACTTCACGTCATCCCC ACCTTCCTCCAG- 
TTTATCACTGGCAGTCTCC (SEQ ID NO: 37) 


Assay: 
crRNA 


PCR Primer 1: 5 ' - ACGTTGGATGGGAGAGGGTGAAGGTGATGC 
(SEQ ED NO: 38) 

PCR Primer 2: S'-ACGTTGGAAGAGGTAGTTTTCCAGTAGTGC 
(SEQIDNO: 39) 

Extension Primer: 5-CATACGGAAAACTTACCCTT (SEQ ID NO: 40) 
Terrninator Mix: ACT 

Competitor Seq: 5'-TGTAGCCCTGGTCGTAAGGGCCATGATGAC- 
TTCACGTCATCCCCACCTTCCTCCAGTTTAT- 
CACTGGCAGTCTCC (SEQ ID NO: 41) 


Assay: 
taKNA 


PCRPrimer 1: 5 '-ACGTTGGATGTTTCTCCATAGTCGACACCC 
(SEQIDNO: 42) 

PCR Primer 2: 5 '-ACGTTGGATGCTGCCGCCAGGCATCTAG AG 
(SEQIDNO: 43) 

Extension Primer: 5 '-GAAAATTAACTTACTACTACC (SEQ ID NO: 44) 
Terminator Mix: CGT 

Competitor Seq: Plasmid construct taR 12 (for taR L, 10) or 
Plasmid construct taRL (for taR 12) 



64 



WO 2004/046321 



PCT/US2003/036506 



[00210] Table 8: Primers for In Vitro PCR Amplification. 

[00211] 17 = 5 '-TAATACGACTCACTATAGG-3 ' (SEQ ID NO: 45). The same set of 
primers could be used for all crRNA variants because they all contained the same 5 s and 3' 
ends. Due to variable 5' sequences on the taRNA constructs, unique primers were designed 
for each PCR amplification. The same reverse primer was used in taRNA PCR reactions. 



Construct 


PCR Primer (forward) 


crR7, 10, 12 


S^ATTACTCGAG^-TCAGCAGGACGCACTGACC (SEQ ID NO: 46) 


taR7 


5 '-ATTACTCGAG-T7-ACCCAAATCCTAGCGGAG (SEQ ED NO: 47) 


taRlO 


5 * - ATTACTCGAG-T7-ACCCAAATTCATGAGCAG ATTG (SEQ ID NO: 48) 


taR12 


5 '-ATTACTCGAG-T7-ACCCAAATCCAGGAGGTG (SEQ ID NO: 49) 




Construct 


PCR Primer (reverse) 


crR7, 10, 12 


5 '-GTCCAAGCTTTTATTTGTATAGTTCATCCA (SEQ ID NO: 50) 


taR7 




taRlO 


5'-ACCACCGCGCTACTG (SEQ ID NO: 51) 


taR12 





[00212] Example 4: Alternate promoter systems demonstrate modular nature of system 
[00213] One advantageous feature of the present invention is its modular nature, in that it 
does not require use of specific promoters and does not target specific coding sequences. To 
demonstrate the modular nature of the system, the pBAD and PL(tetO) promoters were 
replaced with the PL(tetO) and PL(lacO) promoeres (36), respectively. In this scheme, 
PL(lacO) drives the expression of crR12 whereas PL(tetO) produces taR12. Similar to the 
system described in the examples above, we observe near autofluorescence levels of 
repression from crR12 GFP expression. Through the use of different promoters, we also 
demonstrate that the riboregulators functions independent of specific promoters and thus can 
be utilized with any promoter of choice. 

[00214] In the new riboregulator system, we chose to transcribe taR12 from the following 
six different positions relative to the transcription start site (36) of PL(tetO): +1, +3, +5, 
+19, +21, and +23. No detectable activation was observed in the +1, +19, +21, and +23 
variants; however, the +3 and + 5 variants demonstrated 9x and 13x GFP activation, 
respectively. No detectable activation was observed in the +1, +19, +21, and +23 variants; 
however, the +3 and + 5 variants demonstrated 9x and 13x GFP activation, respectively. 
These data reveal an important mechanistic feature of this system: the taRNA, which targets 
the consensus loop of the crRNA, sensitively depends on an accessible 5 ! linear 
complementary sequence in the crRNA. While not wishing to be bound by any theory, it is 
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possible that an elongated (+19, +21, +23), or truncated (+1), 5 1 end on the taRNA interferes 
with taRNA-crRNA interaction, preventing stable intermolecular duplex formation. We 
note also that the +1 variant lacks the YUNR motif, further suggesting the importance of this 
sequence element. 

i [00215] Example 5: Variant trans-activating UNA elements reveal important structural 
features 

[00216] We designed several experiments to determine the effect of alternations in the 
taR12 and/or crR12 sequences on the lOx activation observed in the taR12-crR12 
riboregulator pair. In an attempt to construct a hairpin stem-loop that is more susceptible to 

i open complex formation upon taR12 induction, we first decreased the number of base-pairs, 
maintaining three dispersed bulges, in the cis stem sequence of crR12. While cells 
containing these variants exhibited similar levels of activation to the original constructs, the 
dynamic range was significantly reduced due to less stable stem loops resulting in elevated 
low states. In an effort to increase the cellular concentration of the frans-activating RNA, 
we introduced a previously described (32, 42) stabilizer element at the 5' end of taRll 
Cultures containing the 5-stabilized taR12 transcripts at high concentrations show no 
activated state above the repressed state established by crR12, suggesting that this stabilizer 
element may interfere with taR12 recognition of its loop target on crR12. This result, which 
is consistent with the results obtained using taRNAs transcribed from sites at positions +19, 

► +21, +23 from the transcription start site (see Example 4) suggests that it is preferable to 
avoid an overly long impaired sequence at the 5* end of the taRNA in order to preserve 
fra/w-activation. 

[00217] Next, we pursued two approaches to generate a more stable taR12-crR12 duplex. 
First, we created three additional taR12 variants with greater than 95% and 100% sequence 
complementarity to the crRNA. Second, we constructed three more taR12 variants such that 
the 3' end of the taR12 stem, which is exposed in duplex formation, binds to the 5 ! UTR 
directly upstream of the cis sequence. Both sets of variants showed no detectable increase in 
the lOx level of activation Table 6 lists the taRNA variants that were produced. 
[00218] The foregoing description is to be understood as being representative only and is 
not intended to be limiting. Variations on the designs of cisltrans riboregulators described 
herein, and alternative methods for making and using them will be apparent to one of skill in 
the art and are intended to be included within the accompanying claims. 
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We claim: 

1 . An engineered nucleic acid molecule comprising: 

(i) a first stem-forming portion; 

(ii) a second stem-forming portion, wherein the two stem-forming portions 
are complementary or substantially complementary, and 

(iii) a non-stem- forming portion that forms a loop connecting the 3' end of 
the first stem-forming portion and the 5' end of the second stem-forming portion, 
wherein the engineered nucleic acid molecule forms a stem-loop structure that 
represses translation when positioned upstream of an open reading frame (ORF). 

2. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is 
positioned upstream of the ORF. 

3. The nucleic acid molecule of claim 1, wherein the first and second stem-forming 
portions are substantially complementary. 

4. The nucleic acid molecule of claim 1, wherein at least a portion of the first stem- 
forming portion is complementary or substantially complementary to a ribosome 
binding site (RBS). 

5. The nucleic acid molecule of claim 1, wherein at least a portion of the first stem- 
forming portion is complementary or substantially complementary to a Kozak 
consensus sequence. 

6. The nucleic acid molecule of claim 1, wherein the sequence of the second stem- 
forming portion comprises an RBS. 

7. The nucleic acid molecule of claim 1, wherein the sequence of the non-stem-forming 
portion comprises YUNR. 

8. The nucleic acid molecule of claim 1, wherein the non-stem forming portion is 4, 5, 
6, 7, 8, 9, 10, 11, or 12 nucleotides in length. 



9. 



The nucleic acid molecule of claim 1, wherein the non-stem forming portion is 
between 13 and 50 nucleotides in length, inclusive. 



WO 2004/046321 



PCT/DS2003/036506 



10. The nucleic acid molecule of claim 1, wherein the length of the stem is between 4 
and 100 nucleotides, inclusive. 

1 1 . The nucleic acid molecule of claim 1, wherein the length of the stem is between 6 
and 50 nucleotides, inclusive. 

12. The nucleic acid molecule of claim 1, wherein the length of the stem is between 12 
and 30 nucleotides, inclusive. 

13. The nucleic acid molecule of claim 1, wherein the length of the stem is 
approximately 19 nucleotides. 

14. The nucleic acid molecule of claim 1, wherein the stem exhibits at least 66% 
complementarity. 

15. The nucleic acid molecule of claim 1, wherein the stem exhibits between 75 and 95% 
complementarity. 

16. The nucleic acid molecule of claim 1, wherein the stem exhibits approximately 85% 
complementarity. 

17. The nucleic acid molecule of claim 1 , wherein the stem includes at least one area of 
non-complementarity. 

18. The nucleic acid molecule of claim 17, wherein the stem includes at least one bulge. 

19. The nucleic acid molecule of claim 1, wherein the stem includes at least two 
dispersed areas of non-complementarity. 

20. The nucleic acid molecule of claim 19, wherein the stem includes at least two 
dispersed bulges. 

2 1 . The nucleic acid molecule of claim 1 , wherein the stem includes at least three 
dispersed areas of non-complementarity. 

22. The nucleic acid molecule of claim 21, wherein the stem includes at least three 
dispersed bulges. 
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23. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule forms a 
single stable stem. 

24. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule represses 
translation in the absence of a ligand. 

25. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is 
composed of RNA. 

26. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is 
composed of DNA. 

27. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is 
composed of DNA and RNA. 

28. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises a 
nucleotide analog. 

29. The nucleic acid molecule of claim 1 , wherein the first stem-forming portion 
comprises a sequence complementary or substantially complementary to a sequence 
in the 5' portion of an ORF. 

30. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises a 
start codon. 

3 1 . The nucleic acid molecule of claim 30, wherein the nucleic acid molecule comprises 
a spacer comprising one or more nucleotides between the 3' end of the second stem- 
forming portion and the start codon. 

32. The nucleic acid molecule of claim 30, wherein all or part of the start codon is 
located within the second stem-foiming portion. 

33. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises 
one or more nucleotides upstream of the 5' end of the first stem-forming portion. 

34. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises 
between 5 and 50 nucleotides upstream of the 5' end of the first stem-forming 
portion. 
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35. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises a 
ligand binding domain. 

36. The nucleic acid molecule of claim 1, further comprising a third stem-forming 
portion that is complementary or substantially complementary to the second stem- 
forming portion, wherein the first and third stem-forming portions form alternate 
stem-loop structures with the second stem-forming portion. 

37. The nucleic acid molecule of claim 36, wherein the first and third stem-forming 
portions comprise a portion that is complementary or substantially complementary to 
anRBS. 

38. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule represses 
translation by at least 80%. 

39. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule represses 
translation by at least 90%. 

40. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule represses 
translation by at least 98%. 

41. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule has the 
sequence of crRlO. 

42. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 41, wherein the nucleic acid molecule is composed of RNA. 

43. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule has the 
sequence of crR12. 

44. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 43, wherein the nucleic acid molecule is composed of RNA. 

45. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is a variant 
of crRlO and differs from crRlO by 12 or less nucleotides and includes at least 3 
dispersed areas of non-complementarity. 
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46. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 45. 

47. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule is a variant 
of crR12 and differs from crR12 by 12 or less nucleotides and includes at least 3 
dispersed areas of non-complementarity. 

48. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 47, wherein the nucleic acid molecule is composed of RNA. 

49. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 1, wherein the nucleic acid molecule is composed of RNA. 

50. A cell comprising the DNA construct of claim 49. 

51. A transgenic non-human mammal comprising the DNA construct of claim 49. 

52. A plasmid comprising the DNA construct of claim 49. 

53. The plasmid of claim 52, wherein the plasmid comprises a promoter operably linked 
to the template for transcription of the nucleic acid molecule. 

54. The plasmid of claim 53, wherein the promoter is inducible. 

55. The plasmid of claim 53, wherein the promoter is synthetic. 

56. The plasmid of claim 53, wherein the promoter is endogenous to a prokaryotic or 
eukaryotic cell. 

57. The plasmid of claim 53, wherein the promoter is responsive to an environmental or 
developmental signal. 

58. The plasmid of claim 53, wherein the promoter functions in prokaryotic cells. 

59. The plasmid of claim 53, wherein the promoter functions in eukaryotic cells. 

60. The plasmid of claim 52, wherein the nucleic acid molecule comprises a start codon 
and wherein the plasmid comprises a restriction site downstream of the template for 
transcription of the start codon. 
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61 . A cell comprising the plasmid of claim 52. 

62. An engineered nucleic acid molecule comprising: 

(i) a first stem-forming portion; 

(ii) a second stem-forming portion; and 

(iii) a non-stem-forming portion, wherein the non-stem-forming portion 
connects the 3* end of the first stem-forming portion and the 5* end of the second 
stem-forming portion to form a loop, 

and wherein a portion of the nucleic acid molecule is complementary or 
substantially complementary, to a portion of a cognate nucleic acid molecule of 
claim 1. 

63. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule comprises 
a portion comprising the sequence YNAR positioned 5 ' to the 5 ' portion of the first 
stem-forming sequence. 

64. The nucleic acid molecule of claim 62, wherein the length of the stem is between 6 
and 50 nucleotides. 

65. The nucleic acid molecule of claim 62, wherein the length of the stem is between 12 
and 30 nucleotides. 

66. The nucleic acid molecule of claim 62, wherein the length of the stem is 
approximately 19 nucleotides. 

67. The nucleic acid molecule of claim 62, wherein the two stem-forming portions 
exhibit at least 66% complementarity. 

68. The nucleic acid molecule of claim 62, wherein the two stem-forming portions 
exhibit between 75 and 95% complementarity. 

69. The nucleic acid molecule of claim 62, wherein the two stem-forming portions 
exhibit approximately 85% complementarity. 

70. The nucleic acid molecule of claim 62, wherein the stem includes at least one area of 
non-complementarity. 
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71. The nucleic acid molecule of claim 70, wherein the stem includes at least one bulge. 

72. The nucleic acid molecule of claim 62, wherein the stem includes at least two 
dispersed areas of non-complementarity. 

73 . The nucleic acid molecule of claim 72, wherein the stem includes at least two 
dispersed bulges. 

74. The nucleic acid molecule of claim 62, wherein the stem includes at least three 
dispersed areas of non-complementarity. 

75. The nucleic acid molecule of claim 74, wherein the stem includes at least three 
dispersed bulges. 

76. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule forms a 
single stem. 

77. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule is 
composed of RNA. 

78. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule is 
composed of DNA. 

79. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule is 
composed of DNA and RNA. 

80. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule comprises 
a nucleotide analog. 

81. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule comprises 
a ligand binding domain. 

82. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule activates 
translation of an mRNA whose translation is repressed by a cognate cw-repressive 
nucleic acid molecule. 

83. The nucleic acid molecule of claim 82, wherein the nucleic acid molecule activates 
translation by at least 5 fold. 
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84. The nucleic acid molecule of claim 82, wherein the nucleic acid molecule activates 
translation by at least 10 fold. 

85. The nucleic acid molecule of claim 82, wherein the nucleic acid molecule activates 
translation by at least 19 fold. 

86. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule has the 
sequence oftaRlO. 

87. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 86, wherein the nucleic acid molecule is composed of RNA. 

88. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule has the 
sequence of taR12. 

89. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 88, wherein the nucleic acid molecule is composed of RNA. 

90. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule is a variant 
of taRlO and differs from taRlO by 12 or less nucleotides and includes at least 3 
dispersed areas of non-complementarity. 

• 91 . A DNA construct that comprises a template for transcription of the nucleic acid 

molecule of claim 90, wherein the nucleic acid molecule is composed of RNA. 

92. The nucleic acid molecule of claim 62, wherein the nucleic acid molecule is a variant 
of taR12 and differs from taR12 by 12 or less nucleotides and includes at least 3 

• dispersed areas of non-complementarity. 

93. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 92, wherein the nucleic acid molecule is composed of RNA. 

94. A DNA construct that comprises a template for transcription of the nucleic acid 
molecule of claim 62, wherein the nucleic acid molecule is composed of RNA. 

95. The DNA construct of claim 94, further comprising a template for transcription of 
the nucleic acid molecule of claim 1, wherein the nucleic acid molecule is composed 
of RNA. 
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96. A cell comprising the DNA construct of claim 94. 

97. A transgenic non-human mammal comprising the DNA construct of claim 94. 

98. A plasmid comprising the DNA construct of claim 94. 

99. The plasmid of claim 98, wherein the plasmid comprises a promoter operably linked 
to the template for transcription of the nucleic acid molecule. 

100. The plasmid of claim 98, wherein the promoter is inducible. 

101 . The plasmid of claim 98, wherein the promoter is synthetic. 

102. The plasmid of claim 98, wherein the promoter is endogenous to a prokaryotic or 
eukaryotic cell. 

103. The plasmid of claim 98, wherein the promoter is responsive to an environmental or 
developmental signal. 

104. The plasmid of claim 98, wherein the promoter functions in prokaryotic cells. 

105. The plasmid of claim 98, wherein the promoter functions in eukaryotic cells. 

1 06. A cell comprising the plasmid of claim 98. 

1 07. The plasmid of claim 98, further comprising the DNA construct of claim 49. 

108. An engineered nucleic acid molecule comprising a sequence that is complementary 
or substantially complementary to a sequence located within a region comprising the 
5' UTR of an mRNA and the first 20 nucleotides of the open reading frame, wherein 
the molecule represses translation of the open reading frame when positioned within 
the 5' UTR of the mRNA by forming a stem-loop structure with the sequence to 
which it is complementary or substantially complementary. 

1 09. The nucleic acid molecule of claim 108, wherein the nucleic acid molecule 
comprises a YUNR sequence within a portion of the molecule that forms the loop in 
the stem-loop structure. 
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110. The nucleic acid molecule of claim 1 08, wherein the nucleic acid molecule 
comprises a sequence that is complementary or substantially complementary to an 
RBS. 

111. The nucleic acid molecule of claim 1 08, wherein the nucleic acid molecule 
comprises a sequence that is complementary or substantially complementary to a 
Kozak consensus sequence. 

112. An engineered nucleic acid molecule comprising: 

(i) a first stem-forming portion; 

(ii) a second stem-forming portion, wherein the two stem-forming portions 
are complementary or substantially complementary, and wherein the second stem- 
forming portion comprises a 5* portion of an open reading frame and the first stem- 
forming portion comprises a portion that is complementary or substantially 
complementary to the 5* portion of the open reading frame; 

(iii) a non-stem-forming portion that forms a loop connecting the 3' end of 
the first stem-forming portion and the 5* end of the second stem- forming portion, 
wherein the engineered nucleic acid molecule forms a stem-loop structure that 
represses translation of the open reading frame (ORF). 

1 13. The nucleic acid molecule of claim 1 12, wherein the nucleic acid molecule 
comprises a YUNR sequence within the non-stem-forming portion. 

114. The nucleic acid molecule of claim 1 12, wherein the first stem-forming portion 
comprises a sequence that is complementary or substantially complementary to an 
RBS. 

115. The nucleic acid molecule of claim 1 12, wherein the first stem-forming portion 
comprises a sequence that is complementary or substantially complementary to a 
Kozak consensus sequence. 

116. A system for control of gene expression comprising: 
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(i) a first nucleic acid molecule comprising a ris-repressive sequence element 
upstream of an open reading frame (ORF), wherein the first nucleic acid molecule 
forms a stem-loop structure that represses translation of the ORF; and 

(ii) a second nucleic acid molecule comprising first and second stem-forming 
portions and a non-stem-forming portion, wherein the non-stem-forming portion 
connects the 3' end of the first stem-forming portion and the 5* end of the second 
stem-forming portion to form a loop, and wherein a portion of the second nucleic 
acid molecule is complementary or substantially complementary to a portion of the 
first nucleic acid molecule and interacts with the first nucleic acid molecule to 
derepress translation of the ORF. 

117. The system of claim 116, wherein the first nucleic acid molecule represses 
translation by at least 80%. 

118. The system of claim 116, wherein the first nucleic acid molecule represses 
translation by at least 90%. 

119. The system of claim 116, wherein the first nucleic acid molecule represses 
translation by at least 98%. 

120. The system of claim 116, wherein the second nucleic acid molecule activates 
translation by at least 5 fold. 

121. The system of claim 116, wherein the second nucleic acid molecule activates 
translation by at least 10 fold. 

122. The system of claim 116, wherein the second nucleic acid molecule activates 
translation by at least 19 fold. 

123. The system of claim 1 16, wherein the first and second nucleic acid molecules are 
composed of RNA, 

124. The system of claim 116, wherein the first and second nucleic acid molecules are 
composed of DNA. 

125. The system of claim 1 16, wherein the first and second nucleic acid molecules are 
composed of DNA and RNA. 
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126. The system of claim 116, wherein the nucleic acid molecule is positioned upstream 
of the ORF. 

127. The system of claim 1 16, wherein the first nucleic acid molecule comprises: 

(i) a first stem-forming portion; 

(ii) a second stem-forming portion, wherein the two stem-forming portions 
are complementary or substantially complementary, and 

(iii) a non-stem-forming portion that forms a loop connecting the 3' end of 
the first stem-forming portion and the 5' end of the second stem- forming portion, 
wherein the engineered nucleic acid molecule forms a stem-loop structure that 
represses translation when positioned upstream of an open reading frame (ORF). 

128. The system of claim 127, wherein the first and second stem-forming portions are 
substantially complementary. 

129. The system of claim 116, wherein at least a portion of the first nucleic acid molecule 
is complementary or substantially complementary to a ribosome binding site (RBS). 

130. The system of claim 116, wherein at least a portion of the first nucleic acid molecule 
is complementary or substantially complementary to a Kozak consensus sequence. 

131. The system of claim 1 27, wherein the sequence of the second stem-forming portion 
comprises an RBS. 

132. The system of claim 127, wherein the sequence of the non-stem-forming portion 
comprises YUNR. 

133. The system of claim 127, wherein the non-stem forming portion is 4, 5, 6, 7, 8, 9, 10, 
1 1, or 12 nucleotides in length. 

134. The system of claim 127, wherein the non-stem forming portion is between 13 and 
50 nucleotides in length, inclusive. 

135. The system of claim 116, wherein the length of the stem is between 4 and 100 
nucleotides, inclusive. 
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136. The system of claim 116, wherein the length of the stem is between 6 and 50 
nucleotides, inclusive. 

137. The system of claim 116, wherein the length of the stem is between 12 and 30 
nucleotides, inclusive. 

138. The system of claim 116, wherein the length of the stem is approximately 19 
nucleotides. 

139. The system of claim 116, wherein the stem exhibits at least 66% complementarity. 

140. The system of claim 116, wherein the stem exhibits between 75 and 95% 
complementarity. 

141 . The system of claim 1 16, wherein the stem exhibits approximately 85% 
complementarity. 

142. The system of claim 1 16, wherein the stem includes at least one area of non- 
complementarity. 

143. The system of claim 142, wherein the stem includes at least one bulge. 

144. The system of claim 1 16, wherein the stem includes at least two dispersed areas of 
non-complementarity. 

145. The system of claim 144, wherein the stem includes at least two dispersed bulges. 

146. The system of claim 116, wherein the stem includes at least three dispersed areas of 
non-complementarity. 

147. The system of claim 146, wherein the stem includes at least three dispersed bulges. 

148. The system of claim 1 16, wherein the nucleic acid molecule forms a single stable 
stem. 

149. The system of claim 116, wherein the nucleic acid molecule represses translation in 
the absence of a ligaftd. 
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1 50. The system of claim 116, wherein the first stem-forming portion comprises a 
sequence complementary or substantially complementary to a sequence in the 5' 
portion ofanORF. 

151. The system of claim 127, wherein the nucleic acid molecule comprises a start codon. 

• >. 

152. The system of claim 151, wherein the nucleic acid molecule comprises a spacer 
comprising one or more nucleotides between the 3' end of the second stem-forming 
portion and the start codon. 

153. The system of claim 151, wherein all or part of the start codon is located within the 
second stem-forming portion. 

154. The system of claim 116, wherein the nucleic acid molecule comprises one or more 
nucleotides at the 5' end that do not participate in the stem-loop structure. 

155. The system of claim 116, wherein the nucleic acid molecule comprises between 5 
and 50 nucleotides upstream of the 5 5 end of the first stem-forming portion. 

156. The system of claim 116, wherein the nucleic acid molecule comprises a ligand 
binding domain. 

157. The system of claim 127, wherin the nucleic acid molecule comprises a third stem- 
forming portion that is complementary or substantially complementary to the second 
stem-forming portion, wherein the first and third stem-forming portions form 
alternate stem- loop structures with the second stem-forming portion. 

158. The system of claim 157, wherein the first and third stem-forming portions comprise 
a portion that is complementary or substantially complementary to an RBS. 

159. The system of claim 116, wherein the second nucleic acid molecule comprises a 
portion comprising the sequence YNAR positioned 5' to the 5' portion of the first 
stem-forming sequence. 

160. The system of claim 116, wherein the length of the stem formed by the two stem- 
forming portions of the second nucleicacid molecule is between 6 and 50 
nucleotides. 
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161. The system of claim 1 16, wherein the length of the stem formed by the two stem- 
forming portions of the second nucleic acid molecule is between 12 and 30 
nucleotides. 

1 62. The system of claim 1 1 6, wherein the length of the stem formed by the two stem- 
forming portions of the second nucleic acid molecule is approximately 19 
nucleotides. 

163. The system of claim 116, wherein the two stem-forming portions of the second 
nucleic acid molecule exhibit at least 66% complementarity. 

164. The system of claim 116, wherein the two stem-forming portions of the second 
nucleic acid molecule exhibit between 75 and 95% complementarity. 

165. The system of claim 116, wherein the two stem-forming portions of the second 
nucleic acid molecule exhibit approximately 85% complementarity. 

1 66. The system of claim 1 1 6, wherein the stem formed by the two stem-forming portions 
of the second nucleic acid molecule includes at least one area of non- 
complementarity. 

1 67. The system of claim 1 16, wherein the stem formed by the two stem- forming portions 
of the second nucleic acid molecule includes at least two dispersed areas of non- 
complementarity. 

1 68. The system of claim 116, wherein the stem formed by the two stem-forming portions 
of the second nucleic acid molecule includes at least three dispersed areas of non- 
complementarity. 

169. The system of claim 116, wherein the second nucleic acid molecule comprises a 
nucleotide analog. 

170. The system of claim 116, wherein the second nucleic acid molecule comprises a 
ligand binding domain. 
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171. The system of claim 1 1 6, wherein the first and second nucleic acid molecules 
interact so as to disrupt the stem-loop structure formed by the first nucleic acid 
molecule, thereby allowing a ribosome to gain access to a ribosome binding site. 

172. The system of claim 1 16, wherein the first nucleic acid molecule has the sequence of 
crRlO and the second nucleic acid molecule has the sequence of taRlO. 

173. The system of claim 1 16, wherein the first nucleic acid molecule has the sequence of 
crR12 and the second nucleic acid molecule has the sequence of taR12. 

174. The system of claim 1 16, wherein the first nucleic acid molecule has the sequence of 
crRlO or a variant of crRlO that differs from crRlO by 12 nucleotides or less and 
includes at least 3 dispersed areas of non-complementarity and the second nucleic 
acid molecule has the sequence of taRlO or a variant of taRlO that differs from 
taRlO by 12 nucleotides or less and includes at least 3 dispersed areas of non- 
complementarity. 

1 75. The system of claim 116, wherein the first nucleic acid molecule has the sequence of 
crR12 or a variant of crR12 that differs from crR12 by 12 nucleotides or less and 
includes at least 3 dispersed areas of non-complementarity and the second nucleic 
acid molecule has the sequence of taR12 or a variant of taR12 that differs from 
taR12 by 12 nucleotides or less and includes at least 3 dispersed areas of non- 
complementarity. 

176. The system of claim 116, wherein the first nucleic acid molecule and the second 
nucleic acid molecule have an equilibrium association constant between 0.8 x 10 7 
and 1.5 x 10 7 kcal/mol. 

1 77. A kit for allowing a user to regulate expression of a gene of choice comprising: 

(a) a first plasmid comprising 

(i) a template for transcription of a cis-repressive RNA element; and 

(ii) a promoter located upstream of the template for transcription of 
the cis-repressive RNA element; 

(b) a second plasmid comprising 
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(i) a template for transcription of a cognate trans-activating RNA 
element; and 

(ii) a promoter located upstream of the template for transcription of 
the trans-activating RNA element; and 

(c) one or more elements selected from the list consisting of: (i) one or more 
inducers; (ii) host cells; (iii) one or more buffers; (iv) an enzyme, e.g., a restriction 
enzyme; (v) DNA isolation reagents; (vi) DNA purification reagents; (vii) a control 
plasmid lacking a crRNA or taRNA sequence; (viii) a control plasmid containing a 
crRNA or taRNA sequence or both; (ix) sequencing primers; and (x) instructions for 
use. 

178. A kit for allowing a user to regulate expression of a gene of choice comprising: 

a plasmid comprising a template for transcription of a cis-repressive RNA 
element and a promoter located upstream of the template for transcription of the cis- 
repressive RNA element and further comprising a template for transcription of a 
cognate trans-activating RNA element and a promoter located upstream of the 
template for transcription of the cognate trans-activating RNA element; and 

one or more elements selected from the list consisting of: (i) one or more 
inducers; (ii) host cells; (iii) one or more buffers; (iv) an enzyme, e.g., a restriction 
enzyme; (v) DNA isolation reagents; (vi) DNA purification reagents; (vii) a control 
plasmid lacking a crRNA or taRNA sequence; (viii) a control plasmid containing a 
crRNA or taRNA sequence or both; (ix) sequencing primers; and (x) instructions for 
use. 

179. A kit for allowing a user to regulate expression of a gene of choice comprising: 

(a) a first plasmid comprising 

(i) a template for transcription of a cis-repressive RNA element; and 

(ii) a promoter located upstream of the template for transcription of 
the cis-repressive RNA element; 

(b) a second plasmid comprising 

(i) a template for transcription of a cognate trans-activating RNA 
element; and 

(ii) a promoter located upstream of the template for transcription of 
the trans-activating RNA element; 
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(c) a third plasmid comprising a template for transcription of a cis-repressive 
RNA element and a promoter located upstream of the template for transcription of 
the cis-repressive RNA element and further comprising a template for transcription 
of a cognate trans-activating RNA element and a promoter located upstream of the 
template for transcription of the cognate trans-activating RNA element; and 

(d) one or more elements selected from the list consisting of: (i) one or more 
inducers; (ii) host cells; (iii) one or more buffers; (iv) an enzyme, e.g., a restriction 
enzyme; (v) DNA isolation reagents; (vi) DNA purification reagents; (vii) a control 
plasmid lacking a crRNA or taRNA sequence; (viii) a control plasmid containing a 
crRNA or taRNA sequence or both; (ix) sequencing primers; and (x) instructions for 
use. 

180. A kit comprising 

one or more oligonucleotides comprising a crRNA sequence, one or more 
oligonucleotides comprising a taRNA sequence, or one or more oligonucleotides 
comprising a crRNA sequence and one or more oligonucleotides comprising a 
taRNA sequence, wherein the kit further comprises one or more items selected from 
the group consisting of: (i) one or more inducers; (ii) host cells; (iii) one or more 
buffers; (iv) an enzyme, e.g., a restriction enzyme; (v) DNA isolation reagents; (vi) 
DNA purification reagents; (vii) a control plasmid lacking a crRNA or taRNA 
sequence; (viii) a control plasmid containing a crRNA or taRNA sequence or both; 
(ix) sequencing primers; and (x) instructions for use. 

181. A method of regulating translation of an open reading frame comprising: 

introducing an engineered template for transcription of an mRNA into a cell 
and allowing mRNA trancription to occur resulting in a transcribed mRNA, wherein 
the template is engineered so that the transcribed mRNA comprises first and second 
nucleic acid elements that form a stem-loop structure that represses translation of the 
mRNA; and 

providing an engineered nucleic acid molecule that interacts with the mRNA 
so as to derepress translation of the mRNA to the cell 

1 82. The method of claim 181, wherein the engineered template comprises: 

(i) a first stem-forming portion; 
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(ii) a second stem-forming portion, wherein the two stem-forming portions 
are complementary or substantially complementary, and 

(iii) a non-stem-forming portion that forms a loop connecting the 3* end of 
the first stem-forming portion and the 5' end of the second stem-forming portion, 
wherein the engineered nucleic acid molecule forms a stem-loop structure that 
represses translation when positioned upstream of an open reading frame (ORP). 

183. The method of claim 182, wherein the first and second stem-forming portions are 
substantially complementary. 

184. The method of claim 182, wherein at least a portion of the first stem-forming portion 
is complementary or substantially complementary to a ribosome binding site . 

1 85. The method of claim 182, wherein at least a portion of the first stem-forming portion 
is complementary or substantially complementary to a Kozak consensus sequence. 

1 86. The method of claim 1 82, wherein the sequence of the second stem-forming portion 
comprises anRBS. 

1 87. The method of claim 1 82, wherein the sequence of the non-stem-forming portion 
comprises YUNR. 

188. The method of claim 1 82, wherein the non-stem forming portion is 4, 5, 6, 7, 8, 9, 
10, 11, or 12 nucleotides in length. 

1 89. The method of claim 1 82, wherein the non-stem forming portion is between 13 and 
50 nucleotides in length, inclusive. 

190. The method of claim 182, wherein the length of the stem is between 4 and 100 
nucleotides, inclusive. 

191. The method of claim 182, wherein the length of the stem is between 6 and 50 
nucleotides, inclusive. 

192. The method of claim 182, wherein the length of the stem is between 12 and 30 
nucleotides, inclusive. 
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193. The method of claim 182, wherein the length of the stem is approximately 19 
nucleotides. 

194. The method of claim 182, wherein the stem exhibits at least 66% complementarity. 

195. The method of claim 182, wherein the stem exhibits between 75 and 95% 
complementarity. 

196. The method of claim 182, wherein the stem exhibits approximately 85% 
complementarity. 

197. The method of claim 1 82, wherein the stem includes at least one area of non- 
complementarity. 

198. The method of claim 1 97, wherein the stem includes at least one bulge. 

1 99. The method of claim 1 82, wherein the stem includes at least two dispersed areas of 
non-complementarity. 

200. The method of claim 199, wherein the stem includes at least two dispersed bulges. 

201. The method of claim 182, wherein the stem includes at least three dispersed areas of 
non-complementarity. 

202. The method of claim 201, wherein the stem includes at least three dispersed bulges. 

203. The method of claim 182, wherein the first stem-forming portion comprises a 
sequence complementary or substantially complementary to a sequence in the 5' 
portion of an ORF. 

204. The method of claim 182, wherein the transcribed mRNA comprises a start codon 
and a spacer comprising one or more nucleotides between the 3' end of the second 
stem-forming portion and the start codon. 

205. The method of claim 204, wherein all or part of the start codon is located within the 
second stem-forming portion. 
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206. The method of claim 1 82, wherein the transcribed mRNA comprises one or more 
nucleotides upstream of the 5' end of the first stem-forming portion. 

207. The method of claim 182, wherein the transcribed mRNA comprises between 5 and 
50 nucleotides upstream of the 5' end of the first stem-forming portion. 

208. The method of claim 182, wherein the transcribed mRNA further comprises a third 
stem-forming portion that is complementary or substantially complementary to the 
second stem-forming portion, wherein the first and third stem-forming portions form 
alternate stem-loop structures with the second stem-forming portion. 

209. The method of claim 208, wherein the first and third stem-forming portions comprise 
a portion that is complementary or substantially complementary to an RBS. 

210. The method of claim 181, wherein the step of providing comprises: 

inducing transcription of the engineered nucleic acid molecule in the cell. 

211. The method of claim 181, wherein the step of providing comprises: 

delivering the engineered nucleic acid molecule exogenously. 

212. The method of claim 181, wherein the engineered nucleic acid molecule comprises: 

(i) a first stem-forming portion; 

(ii) a second stem-forming portion; and 

(iii) a non-stem-forming portion, wherein the non-stem-forming portion 
connects the 3' end of the first stem-forming portion and the 5' end of the second 
stem-forming portion to form a loop, 

and wherein a portion of the nucleic acid molecule is complementary or 
substantially complementary, to a portion of the transcribed mRNA. 

213. The method of claim 212, wherein the nucleic acid molecule comprises a portion 
comprising the sequence YNAR positioned 5* to the 5' portion of the first stem- 
forming sequence. 

214. The method of claim 212, wherein the length of the stem is between 4 and 100 
nucleotides, inclusive. 
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215. The method of claim 212, wherein the length of the stem is between 6 and 50 
nucleotides, inclusive. 

216. The method of claim 212, wherein the length of the stem is between 12 and 30 
nucleotides, inclusive. 

217. The method of claim 212, wherein the length of the stem is approximately 19 
nucleotides. 

218. The method of claim 212, wherein the two stem-forming portions exhibit at least 
66% complementarity. 

219. The method of claim 212, wherein the two stem-forming portions exhibit between 75 
and 95% complementarity. 

220. The method of claim 212, wherein the two stem-forming portions exhibit 
approximately 85% complementarity. 

221 . The method of claim 212, wherein the stem includes at least two dispersed areas of 
non-complementarity. 

222. The method of claim 221, wherein the stem includes at least two dispersed bulges. 

223. The method of claim 212, wherein the stem includes at least three dispersed areas of 
non-complementarity. 

224. The method of claim 223, wherein the stem includes at least three dispersed bulges. 

225. The method of claim 212, wherein the nucleic acid molecule forms a single stem. 

226. The method of claim 181, wherein the nucleic acid molecule is composed of RNA. 

227. The method of claim 181, wherein the nucleic acid molecule is composed of DNA. 

228. The method of claim 181, wherein the nucleic acid molecule is composed of DNA 
and RNA. 

229. The method of claim 181, wherein the nucleic acid molecule comprises a nucleotide 
analog. 
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230. The method of claim 181, wherein the nucleic acid molecule comprises a ligand 
binding domain. 

231. The method of claim 181, wherein the first nucleic acid molecule has the sequence of 
crRlO and the second nucleic acid molecule has the sequence of taRlO. 

232. The method of claim 181, wherein the first nucleic acid molecule has the sequence of 
crR12 and the second nucleic acid molecule has the sequence of taR12. 

233. The method of claim 181, wherein the first nucleic acid molecule has the sequence of 
crRlO or a variant of crRlO that differs from crRlO by 12 nucleotides or less and 
includes at least 3 dispersed areas of non-complementarity and the second nucleic 
acid molecule has the sequence of taRlO or a variant of taRlO that differs from 
taRlO by 12 nucleotides or less and includes at least 3 dispersed areas of non- 
complementarity. 

234. The method of claim 181, wherein the first nucleic acid molecule has the sequence of 
crR12 or a variant of crR12 that differs from crR12 by 12 nucleotides or less and 
includes at least 3 dispersed areas of non-complementarity and the second nucleic 
acid molecule has the sequence of taR12 or a variant of taR12 that differs from 
taR12 by 12 nucleotides or less and includes at least 3 dispersed areas of non- 
complementarity. 

235. The method of claim 181, wherein the first nucleic acid molecule and the second 
nucleic acid molecule have an equilibrium association constant between 0.8 x 10 7 
and 1.5 x 10 7 kcal/mol. 

236. A method of analyzing gene expression comprising: 

(i) repressing translation of an mRNA transcribed from a gene of interest 
using a cw-repressive RNA element; 

(ii) measuring expression of one or more genes; 

(iii) activating translation of the mRNA using a cognate fra/u-activating 
RNA element; and 

(iv) measuring expression of the one or more genes of step (ii). 
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237. The method of claim 236, wherein the steps of measuring gene expression comprise 
obtaining a gene expression profile using a microarray. 

238. A method of selecting a cognate pair of nucleic acid molecules for regulating 
translation comprising steps of: 

(i) providing one or more starting nucleic acid sequences; 

(ii) randomizing the sequence or sequences to generate one or more pools of 
randomized nucleic acid sequences; and 

(iii) employing in vitro selection to identify a candidate cognate nucleic acid 
pair comprising a repressive element that represses translation when positioned 
upstream of an ORF and an activating element that derepresses translation that is 
repressed by the candidate repressive element. 

239. The method of claim 238, further comprising the step of: 

(i) testing the repressive element by inserting it upstream of an open reading 
frame and measuring translation; or 

(ii) testing the activating element by determining whether it derepresses 
translation that is repressed by the repressive element; or 

(iii) performing both (i) and (ii). 

240. The method of claim 238, wherein the one or more nucleotide sequences are 
provided as DNA. 

241 . A method of selecting a cognate pair of nucleic acid molecules for regulating 
translation comprising steps of: 

(i) providing a starting sequence of a crRNA:taRNA cognate pair; and 

(ii) making one or more nucleotide additions, deletions, or substitutions in 
either or both sequences, such that the overall percentage of complementarity in the 
crRNA stem, the taRNA stem, and the crRNA:taRNA duplex remain within 10% of 
their starting values and the number of dispersed areas of non-complementarity in 
the crRNA stem, the taRNA stem, and the crRNA:taRNA duplex remain the same as 
their starting values. 

242. A collection of cognate m-repressive and /rojw-activating nucleic acid molecule 
pairs. 
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